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i!lEARY_SCREENING_METHOD 

Description 

i££i£££HSi_£f_the_Invention 

A genomic DNA "library" is formed by digesting 
5 genomic DNA from a particular organism with a suitable 
restriction enzyme, joining the genomic DNA fragments to 
vectors and introducing the DNA fragment- containing 
vectors into a population of host cells. Complementary 
DNA (cDNA) is DNA which has been produced by an enzyme 

_kn^n_as_r.e.v.e.r. S .e_t.r.a.nsc-r-i-p-t-a-se 

complementary strand of DNA ( cDNA) using a mRNA strand as 
a template. A c DNA library is formed by joining the cDNA 
fragments to vectors and introducing the cDNA fragment- 
containing vectors into a population of host cells. 

In a DNA or cDNA library, the pieces of DNA exist as 
an unordered collection of thousands or millions of 
Pieces. To isolate a host cell carrying a specific DNA 
sequence (i.e., a specific DNA clone), the entire library 
must be screened. Radioactively labeled or otherwise 
labeled nucleic acid probes are traditionally employed to 
screen a DNA or cDNA library. Nucleic acid probes 
identify a specific DNA sequence by a process of in vitro 
hybridization between complementary DNA sequences In the 
probe and the DNA clone. 

A specific DNA clone that has been identified and 
isolated in this manner can contain DNA that is con- 
tiguous to the probe sequence. A terminus of the DNA 
clone, therefore, can be used as a new probe to rescreen 
the same or another DNA library to obtain a second DNA 
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clone which has an overlapping sequence with the first 
DNA clone. By obtaining a set of overlapping DNA clones, 
a physical map of a genomic region on a chromosome may be 
constructed. This process is called "chromosome walking" 

5 because each overlapping DNA clone which is isolated is 
one step further along the chromosome. Each DNA clone 
also can be studied to determine its genetic relationship 
to a previously mapped genetic function and, thus, a 
series of overlapping DNA clones provides a physical map 

X0 of a chromosome which may be correlated to a map of 
genetic functions. 



Chromosome walking is used, for example, to identify 
or localize a gene of interest, such as one thought to be 
causative of or associated with a disease or other 

15 condition, phenotype or quantitative trait. This is done 
by using a DNA fragment which displays a restriction 
fragment length polymorphism (RFLP) shown to be genetic- 
ally linked to (i.e., physically localized to the same 
chromosome region as) a gene which causes or is 

20 associated with a disease, or other condition, phenotype, 
or quantitative trait or a segment of DNA contiguous to 
such a RFLP or a cDNA , as an in vitro hybridization probe 
to screen a DNA library and pull out larger fragments of 
DNA in which all or part of the probe sequence is repre- 

25 sented. 

The usefulness of any DNA clone isolated in this 
manner is that it includes DNA that is contiguous to the 
RFLP sequence that is incrementally closer to the 
position of the sought-after gene than the original RFLP. 
30 To get a step closer, a labeled molecule corresponding to 
an end of the newly isolated DNA clone is prepared and 
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used to rescreen the library, with the goal being to 
isolate DNA clones that overlap with sequences found in 
the first DNA clone and that are incrementally closer to 
the gene of interest than either the starting probe or 
the first DNA clone isolated. This procedure is repeated 
as needed, with the resulting DNA clones being used in 
genetic studies to assess whether they are more closely 
linked to the gene of interest. To walk over a distance 
of 10 million base pairs using presently- available 
chromosome walking techniques could require from 100 to 
__JL^>00^e,p,s,^ veetpr 
used. Any approach designed to decrease the work 
required to take a single walking step or which would 
allow multiple walking projects to be carried out simul- 
taneously would be a major advance. 

The number of DNA clones which would be required to 
form a complete library of genomic DNA is determined by 
the srze of the genome and the DNA clone capacity of the 
vector used to clone and propagate the segments of the 

DNA f c — «« -«...!.. of genomic DNA 

libraries of organisms with large genomes is labor 
-tensive and time consuming. The development of vectors 
hav.ng a capacity for large DNA clones has helped to 
reduce the labor involved in screening genomic libraries 

x:::r:; t :::::: in8 libraries «- «* - 



3 



Sumraary_of_the_Inyention 

The pr es <,„ t l„„.„tio„ j. . „„„„„ „ f ld .„ 

a MA fr.,..„ t of ln „ rest ( . , ' "* 

30 £«„.«>. . DMA llbt „, . . uk „ yotic 
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host cell, which is based on homologous recombination 
between the target DNA fragment and DNA present in a 
targeting DNA molecule introduced into the DNA fragment 
library. It further relates to targeting vectors and DNA 
5 fragment libraries constructed in eukaryotic host cells 
as described herein. 

The method of the present invention is used to 
screen a DNA fragment library constructed in a eukaryotic 
host cell in which genetic recombination (exchange of 
10 information between DNA present in an artificial unit (or 
eplsome) or in a chromosome in the host cell and DNA 
introduced into the host cell) occurs by means of homo- 
logous recombination. In eukaryotic host cells, DNA 
fragments are propagated in the form of an episome or 
15 other artificial unit which is replicatable in the 

eukaryotic host cell. The episome or artificial unit 
includes, in addition to the DNA fragment, sequences 
which can be used for propagation in bacteria, one or 
more marker genes for selection in bacteria and one or 
20 more marker genes for selection in the eukaryotic host 
cells . 

In one embodiment of the present method, in which 
the eukaryotic host cell is yeast, genetic recombination 
occurs essentially exclusively by homologous recombi- 
25 nation. DNA fragments in host cells are propagated in 
the form of artificial chromosomes which include, in 
addition to a DNA fragment insert, all of the DNA 
sequences necessary for the chromosome to participate in 
host cell replication and mitotic segregation in a manner 
30 similar to that of naturally-present host cell chromo- 
somes. In general, the artificial chromosome is present 
in one copy or low-copy number in a host cell. 



I. 
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The present method makes use of a targeting vector 
or vehicle which: 1) includes a DNA sequence, referred 
to as targeting DNA. homologous to at least a portion of 
the target DNA fragment and a selectable marker gene 
5 which is functional in host cells under appropriate con- 
ditions and 2) is non-replicating in the host cell 
Targeting DNA can be any DNA sequence, including genomic 
DNA. cDNA and DNA synthesized using known techniques 
Preferably a double - strand break is made in the targeting 
10 DNA present in the targeting vector, which generally is 

_circu l^ r _w,he.n-p.u r-i -f-i-ed-f-r-o *-a n^-go^p h 0 s t . Alterna 

txvely, a gap can be introduced by .making: two cuts in the 
targeting DNA (e.g.., wlth appropriately selected restric- 
ton en 2 yme(s)). The break or gap renders the vector 
15 linear, provides DNA ends which stimulate homologous 
recombination with host cell artificial chromosome 
sequences and increases the efficiency of stable trans- 
formation by homologous recombination. 

The targeting vector is introduced into cells 
10 harboring the DNA fragment library, producing a mixed 
population of host cells, some of which contain the 
targeting vector and some of which do not. The resulting 
population of host cells i « • , 8 

6118 ls attained under conditions 
appropriate for homologous recombination between DNA 
already present in the cell (i.e., prior to introduction 

h se :r: h g vector> and such 

those in the targeting vector. Subsequently, the 
Population of cells is subjected to conditions appro- 
prxate for selection of host cells in which homologous 
30 recombination has occurred. Because the targeting" t or 
" Unat>le t0 repliCate ^ ^e host cell, stable trans- 
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formation with the selectable marker gene can occur only 
through homologous recombination. The selectable marker 
gene is replicated and, therefore, confers a stable 
phenotype, only in host cells in which homologous 
recombination with sequences that are replicatable in the 
host has occurred. Identification of such host 
cells--and, thus, of host cells containing the target DNA 
fragment of interest--is carried out by culturing the 
population of host cells under conditions (e.g., 
culturing on appropriate media) in which only those host 
— c e4-l-s— i-n— wh-i-ch— h omo-l-og o us - reiSomb~i nation (and stable 
transformation) occurred can survive. Growth of a 
transformed host cell is indicative of the presence of 
the target DNA fragment. Host cells containing a target 
15 DNA fragment are, as a result, separated or isolated from 
host cells which do not contain the target DNA fragment. 
The target DNA fragment can be removed from the host cell 
and sequenced or manipulated (e.g., subcloned or mapped), 
using known techniques. 
20 Alternatively, targeting DNA and a selectable marker 

gene for selection in yeast can be introduced into yeast 
cells containing the DNA fragment library by mating a 
yeast strain containing the targeting DNA and the select- 
able marker gene on a targeting vehicle which is a 
25 replicating yeast linear plasmid with the yeast host 
cells containing the library. I„ this embodiment, the 
two yeast strains must be of opposite mating types. 
Homologous recombination occurs between the targeting 
linear plasmid and a library YAC having DNA homologous to 
30 targeting DNA. producing two linear molecules, each of 
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which is a YAC. In one embodiment, the linear plasmid 
has negatively selectable markers flanking the targeting 
DNA sequence. Each of the two recombination products 
carries one of the two negatively selectable markers, 
5 making differential selection of the two recombination 
products possible. In another embodiment of the method 
in which mating of opposite mating type yeast strains is 
used, a first yeast strain containing a yeast replicating 
plasmid. constructed in such a manner that the targeting 
10 DNA and a first selectable marker gene can be freed from 
^h^as.t-r-ep^e-on-by-re.on.b-^t lon events and a second 
selectable marker gene, which is a negatively selectable 
marker gene, is used to select the replicon itself. When 
thxs strain is mated to all members of a YAC library the 
freed targeting sequence can undergo recombination with 
YAC molecules within the library. 

The replicating yeast plasmids described above can 
also be introduced into host cells containing YACs by 
transformation . 

In a preferred embodiment, the DNA fragment library 
is constructed in yeast, such as Saccharomvces (S ) 
cerevisiae or Schi.osaccharomyces (,., ^mbl, in which 
DNA fragments are present in yeast artificial chromosomes 
(YAC) . Each yeast host cell contains one YAC or a few 
YACs each present in one or few copies. A YAC includes, 
in addition to a DNA fragment, all of the DNA sequences 
required for chromosomes to replicate in yeast, segregate 
chromosomes to their progeny and stabilize chromosome 
ends. In this embodiment, the targeting vector used is a 
bacterial plasmid or other vector which does not repli- 
cate in yeast and includes targeting DNA and a selectable 



20 



WO 93/03183 PCT/US91/08679 



-8- 



marker gene that functions in yeast. The targeting 
vector, which preferably has been linearized by intro- 
ducing a double-strand break within the targeting DNA of 
the bacterial plasmid, is introduced into yeast cells. 
5 The resulting mixed population of yeast cells is main- 
tained under conditions appropriate for homologous 
recombination to occur between targeting DNA and target 
DNA in the YAC . This is followed by selection of yeast 
cells stably transformed with the targeting DNA and 
10 selectable marker gene. Stable transformation of the 

yeast cells confers on them a s.e.l.e.c.table-ph.eno.ty-pe- i — suc-h- 

as antibiotic resistance, nutrient prototrophy (such as 
amino acid prototrophy or nucleoside prototrophy), 
tolerance to a metal ion, ability to progress through the 
15 cell cycle or expression of a cell surface marker. 

Growth of yeast cells under conditions compatible with 
survival only of stably transformed cells is indicative 
of the presence of the target DNA sequence. Target DNA 
can be removed from the yeast cell and sequenced or 
20 manipulated, using known techniques. 

The present invention also relates to targeting DNA 
molecules and vectors useful in the present method. 
Vectors include targeting vectors, such as bacterial 
plasmids which do not replicate in yeast and include 
25 targeting DNA and a selectable marker gene functional in 
yeast. They may also include a selectable marker gene 
for selection in bacteria. Additional targeting DNA 
molecules include replicating molecules, such as a yeast 
linear plasmid. 

30 YAC arm vectors useful in the present method are 

also the subject of the present invention. These include 
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a yeast selectable marker gene, a bacterial origin of 
replication, a bacterial selectable marker gene, a yeast 
telomere, and one or more cloning sites at which 
targeting DNA is introduced or inserted into the vector. 
In addition, YAC arm vectors can include yeast centromere 
sequences and/or a yeast replication origin. YAC arm 
vectors which are the subject of the present invention 
include those designated pTKENDA , pTKENDA2 , pTKENDB , 
pTKENDC , pTKENDD and their functional equivalents. 

The present invention further relates to eukaryotic 
host cells ,_p A r_t,i.c.u,l.ax.l.y-y.ea.s-t-ce-l-l-s— cons-t rucfrd-n 



described herein and useful for construction of YAC 
libraries from which a DNA fragment of interest can be 
identified and isolated by the claimed method. In 
addition, the present invention relates to DNA fragment 
libraries, particularly YAC libraries, constructed in 
such eukaryotic host cells. 

In one embodiment, a DNA fragment library is con- 
structed in a yeast host strain carrying a chromosomal 
deletion of four selectable marker genes (i.e., the four 
selectable marker genes normally present in the yeast 
strain genome have been deleted). The yeast strain has 
incorporated into it a pair of YAC arm vectors, each of 
which includes the following elements: a yeast select- 
25 able marker gene which is one of the four selectable 

marker genes deleted from the yeast host strain; a bac- 
terial origin of replication; a bacterial selectable 
marker gene and a yeast telomere. The yeast selectable 
marker gene in each member of the pair of YAC arm vectors 
30 " different from that present in the other member of the 
pair. In one embodiment, the yeast strain carries a 
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chromosomal deletion of the ARG4 gene, the TRP1 gene, the 
LEU 2 gene and the URA3 gene. A pair of YAC arm vectors 
can include any combination of pairs of these marker 4 
genes; each member of the pair includes a marker gene 
5 different from that included in the other member of the 
pair. In one embodiment, in which a two-library system 
is used, the yeast host strain carries a chromosomal 
deletion of four selectable marker genes and two pairs of 
YAC arm vectors are used, each carrying a selectable 
10 marker gene deleted from the yeast host strain and not 

. _ present in t he o ther member of the pair nf VAC arm 

vectors in which it is used. Such yeast host strains and 
YAC arm vectors are described in detail herein. 

The method, targeting vectors, YAC arm vectors and 
15 DNA fragment libraries of the present invention are 

useful for identifying and isolating a target DNA frag- 
ment, which can be genomic DNA or cDNA and can be an 
entire gene, gene portion or other DNA sequence. The DNA 
in DNA fragment libraries screened by this method can be 
20 of any type, such as, but not limited to, mammalian 
(particularly human), plant, insect, avian, fish, 
crustacean, molluscan, viral, nematode, amphibian, 
reptilian or protozoan. For example, they can be used to 
identify and isolate a gene associated with a particular 
25 disease, condition, phenotype, or quantitative trait, 
related genes within an organism's genome, and cDNA. 

Further, as described herein, physically contiguous 
DNA sequences can be identified in a YAC library in yeast 
cells (or other DNA fragment library) and used to con- 
30 struct a physical chromosome map. That is, the present 

method is useful for chromosome walking. In this embodi- 

i 
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* VAC "««'»in 5 . target DNA frag.ent t . 

Isolated, using the ,1.1.., hcologous r.co.bin.ti.n- 
based .ethod described h.r.l,. .,„ . ,.„,„„. o£ the 
fragment subclone*. In „„ y inst „ ces , boch 
5 .111 b. subclone* in order „ d . t6 „ in . the c< , rrect 
direction for th . „ lk „ tfc> ^ 

«... target DNA £,.,.„. i. then „.., „ th , 

»« present th . titg . tlns vectori vMch 8 

int. the VAC iibr.ry. A second ,„,« Y4C .. 

' r " ° f " ,h * «»«" »"» '«.««. uhich 

partially ov.riaps th . taT . 6 ^_ roA ^. rTa , CT 

" M """" " -bclon.d and „s.d as the targeting 
»A in a targeting vector in „ oauc . d liito - th< J • 

ibr.r, This r .s»l ts in ilol . tton of . ^ ^ 
<•"«»., . target ui fragment, „ hich parti . llv .„„. 
laps the second target DNA fragnent in seouence. This 
Process results in i,ol. tIo „ ef . .„,.. of yAc » 

tarnin, rarget C „A fragments vhich partially .v.rl.p ,„„ 
oa„ be repeated as n.ny tl „. ., ^ d £ . J 

I" ;" "" Sht - Ch "— •« ^ carri. 

<o Lb ' I >=»Sth polymorph!,. 

' '"!«•« contiguous to a KPLP or a eDKA 

•s targeting MA In the targeting vector e. 
5 Hbrarv i . vector to screen a VAC 

library. A t.r.rnua of tar ,. t ,„ lsol>tea 

nanner le eubclon.d or isolated and the resulting 
sequence used to Isolate a co„tig»o us DNA fragment Ihl s 
rs r. oft .. „.„ dea to . 

«P end, optimally, to reach a desired gen. 
for che RFLP i, associated. ' 
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The method of the subject invention has numerous 
advantages over other approaches to screening DNA 
libraries. For example, it is possible to screen a DNA 
fragment library many times, simultaneously. Libraries 

5 are stored as a pool of clones, thus eliminating the work 
needed to organize and screen a library that is distri- 
buted over many filter membranes. The labor needed to 
screen a library is considerably less than that needed 
with conventional methods. In addition, terminal 

10 sequences are isolated from YAC clones without the need 
for subci o n i~ng in a form s ui t able for subsequent— wa-l-king 
steps . 

Brief Description of _ the_Drawings 

Figure 1 illustrates the identification of target 

X5 DNA fragments in a YAC library by the homologous - 

recombination selection method of the present invention. 
The YAC includes telomeres (arrowheads), centromere/yeast 
origin of replication (filled circles), and a DNA frag- 
ment; in the case of clone #3, the DNA fragment contains 

20 within it a target DNA fragment (solid rectangle) . 

Figure 2 illustrates targeting (homologous recipro- 
cal recombination) to generate a YAC that is marked for 
selection. 

Figure 3 illustrates selection by homologous 
25 recombination of a DNA clone from a DNA YAC library using 
one-step gene disruption. 

Figure 4 illustrates selection of DNA clones by 
homologous recombination using two DNA YAC libraries. 

Figure 5 is a map of plasmid p 184DLARG . B: BamHI ; 
30 Sm: Smal; P: Pstl; ARG4: yeast ARG4 gene (arrow 
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indicates direction of transcription); Cm: chlor- 
amphenicol resistance gene; ORI (pACYC184): Origin of 

replication from pACYC184; ; hypothetical 

targeting sequence inserted into cloning site. 
Figure 6a is a plasmid map of pTKENDA . 
Figure 6b is a plasmid map of pTKENDB. 
Figure 6c is a plasmid map of pTKENDC . 
Figure 6d is a plasmid map of pTKENDD. 
Figure 7 is a restriction enzyme and Southern blot 
analysis of clones selected by targeting with human 
epsilon- and beta-globin sequences. 

Figure 8a contains oligonucleotides used in the 
construction of YAC arm vectors. The sequences in upper 
case letters indicate bases corresponding to oligonucleo- 
15 tides synthesized in vitro. The sequences in lower case 
letters indicate those bases filled in in vitro using 
each pair of annealed oligonucleotides. Relevant 
restriction enzyme recognition sequences are indicated. 
Figure 8b contains oligonucleotides used in the 
20 construction of YAC arm vectors. The sequences in upper 
case letters indicate bases corresponding to oligo- 
nucleotides synthesized in vitro. The sequences in lower 
case letters indicate those bases filled in in vitro 
using each pair of annealed oligonucleotides. Relevant 
restriction enzyme recognition sequences are indicated. 

Figure 8c contains oligonucleotides used in the 
construction of YAC arm vectors. The underlined base 
indicates the mutation from the wild-type sequence. 

Figure 9 is photograph of a restriction enzyme and 
Southern hybridization analysis of DNA from eight yeast 
colonies isolated by screening with fragment 8A . Lanes 



25 



30 
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1-4: clones 8A.1, 8A.2, 8A.3 and 8A.4; Lane 5: plasmid 
pl84-8A. Lanes 6-7: clones 8A.5 and 8A.6; Lane 8: an 
example of DNA from an isolated colony which does not 
show the unit- length- linear band; Lane 9: clone 8A.11. 

5 1 microgram of total yeast DNA was loaded in lanes 1-4 
and 5-9. 2 nanograms of plasmid pl84-8A was loaded in 
lane 5. The electrophoresed DNA samples (all digested 
with Kpnl) were transferred to a nylon membrane and 
hybridized with a 32-P labeled ARG4 DNA probe. The arrow 

in marks the position of the unit - length - linear band at 8.3 
kb. 

Figure 10 is a photograph of a* restriction enzyme 
and Southern hybridization analysis of DNA from each of 
the positive clones digested with Xhol and with either 

15 Kpnl (for those isolated by screening with fragment 8A) 
or Avail (for those isolated by screening with fragment 
10B). Samples were electrophoresed on a 1% agarose gel, 
transferred to a nylon filter, and hybridized with P 
labeled pBR328 (Boehringer Mannheim Biochemicals , 

20 Indianapolis, IN). Lanes 1-7: clones 8A . 1 , 8A.2, 8A.3, 
8A.4, 8A.5, 8A.6, 8A.11 (all isolated by screening with 
fragment 8A> ; lanes 8-10: clones 10B . 6 , 10B.29, 10B.41 
(isolated by screening with fragment 10B) . 

Figure 11 shows analysis of YAC DNA for presence of 

25 unit-length-linear fragments hybridizing to an ARG4 DNA 
probe: Lane 1: EcoNI digest of plasmid 

pl84DLARG/PCRF. 5 , which contains the 852 base pair PstI 
fragment from the human ADA locus cloned into the PstI 
site of pl84DLARG . 1 nanogram of digested plasmid DNA 
30 was loaded; Lanes 2-3: empty (no samples loaded); Lanes 
3-6: EcoNI digested YAC DNA (approximately 1 microgram) 



WO 93/03183 



PCT/US91/08679 



-15- 



from candidate transformants 184ADA.B, 184ADA.C, and 
184ADA.D. The elec trophoresed samples were transferred 
to a nylon membrane and hybridized to a 32-P labeled 
fragment of ARG4 DNA. The arrow indicates the position 

5 of EcoNI linearized plasmid pl84DLARG/PCRF . 5 (5.2 kb ) . 
Figure 12 is a schematic representation of one 
embodiment of the present homologous recombination 
method, in which a YAC containing target DNA is 
identified using recombination with a linear yeast 

10 plasmid. 



^i^^-5£l££i2tion_of_ATCC_De£osi ts 

The following deposits have been made at the 
American Type Culture Collection (June 28, 1990) under 
the accession numbers indicated. These deposits have 
15 been made under the terms of the Budapest Treaty and all 
restrictions upon their availability will be removed upo 
granting of a United States patent. 



1 * laccharomyces cerevisiae strain TD7-16d, ATCC 
No. 74010. 

20 2 - Plasmid pi 84DLARG , ATCC No. 40832. 

3. Plasmid pTKENDA, ATCC No. 40833. 



£etailed_Descri£tio^ 

The present invention is based upon Applicant's 
discovery that the process of homologous recombination 
25 which occurs in eukaryotic cells can be used for the 

purpose of screening DNA fragment libraries constructed 
in eukaryotic cells and identifying and isolating a DNA 
fragment of interest, referred to as a target DNA frag- 
ment. 
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The present invention is a method of isolating a DNA 
fragment of interest, referred to as a target DNA frag- 
ment, from a DNA library constructed in a eukaryotic host 
in which genetic recombination occurs by homologous 
recombination. The target DNA fragment is generally 
present in a larger fragment contained in the eukaryotic 
host cell. The DNA used to construct the DNA libraries 
may be cDNA or genomic DNA which is of human or other 
origin, including that of plants and other mammals. A 
target DNA fragment is identified by the present method 

bv 1 Tifrnriiifti Tver i".t-i ~ tvxt.a ^ , 

iv o ««« j-i-agmenc iiorary a non- 

replicating targeting vehicle which contains targeting 
DNA and an appropriate selectable marker gene and identi- 
fying eukaryotic host cells in which homologous recombi- 
nation occurs between target DNA and targeting DNA. 
Homologous recombination results in stable integration of 
targeting DNA and the selectable marker gene into DNA in 
host cells, which are identified on the basis of a 
selectable phenotype conferred as a result of stable 
transformation of host cells with the selectable marker 
gene. For example, they are identified on the basis of 
their ability to grow under conditions (e.g., in the 
presence of a drug or metal ion or in the absence of an 
essential nutrient) incompatible with growth of host 
cells in which stable integration has not occurred. 

The DNA library used in the present method is a 
population of eukaryotic host cells, such as yeast cells 
containing a unit, such as an artificial chromosome 
which includes a DNA fragment insert and is replicated in 
the host cells. The DNA library is screened for DNA 
fragment insert(s) , present in the artificial chromosome 
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all or a portion of which is a target DNA fragment, by 
introducing into the eukaryotic host cells a targeting 
vehicle, such as a bacterial plasmid, which is non- 
replicating in the eukaryotic host cells and includes a 

5 targeting DNA sequence (i.e., a DNA sequence homologous, 
at least in part, to the target DNA) and a selectable 
marker gene useful for selection in the host cell. Host 
cells containing the targeting vehicle are cultured under 
conditions appropriate for homologous recombination 

10 between the targeting DNA sequence and target DNA to 

occur" HoTt c~e~i~l~s S'fabTy transfor m ~e d — w i - f h the se~l"e-cira-b-l-e 
marker are subsequently identif ied . ( i . e . , by identifying 
host cells able to grow under conditions under which 
non-stably transformed cells cannot grow, and die). 

15 In general, the targeting vehicle is nonr ep 1 i c a t ing 

in the host cell, such as a bacterial plasmid, and 
includes the targeting DNA sequence and a selectable 
marker gene- for selection in the host cell. However, in 
certain embodiments, such as those in which the host cell 

20 is yeast, the targeting vehicle may be replicating 

vehicle, such as a yeast linear plasmid, which includes 
marker genes for selection in yeast and targeting DNA. 

In a specific embodiment of the present invention, 
which is exemplified by the Examples which follow, the 

25 DNA library is a population of yeast cells which contain 
artificial chromosomes carrying a DNA fragment insert and 
host cells containing target DNA are identified and 
isolated from this YAC vector library. 

A targeting vehicle, such as a bacterial plasmid, 

30 which is non- repl icat ing in yeast is introduced into the 
population of host yeast cells containing the DNA YAC 
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library. The bacterial plasmid includes a targeting DNA 
sequence which is homologous, at least in part, to target 
DNA. of interest and a selectable marker gene that 
functions in yeast. Preferably, the targeting plasmid is 

5 cut with a restriction endonuclease that introduces a 
double-strand break within the targeting DNA sequence, 
thereby linearizing the bacterial plasmid and providing 
DNA ends which are recombinogenic , to stimulate the 
process of homologous recombination with the YAC 

10 sequences. The efficiency of homologous recombination 

is,— a s— a_r-e.s-u.l-t-, in.cr.ea s.e.d.. Because the p lasmid is 

non-replicating in yeast, stable transformation with the 
selectable marker can only proceed by integration into 
natural or artificial yeast chromosomes. 

15 The resulting host yeast cell population, which 

includes stably transformed host yeast cells (i.e., those 
in which the plasmid, including the selectable marker 
gene, has been stably integrated by homologous recombi- 
nation into DNA already present in host cells prior to 

20 introduction of the targeting vehicle) and non-stably 
transformed host yeast cells, is cultured under con- 
ditions such that only stably transformed yeast cells are 
able to grow. In a correctly targeted event, the entire 
plasmid is stably incorporated in the host yeast cells by 

25 homologous recombination between the targeting DNA 

sequence of the plasmid and homologous sequences (i.e., 
target DNA fragments) in the YAC. In other embodiments, 
such as that in which a linear targeting molecule is 
used, it is not necessary, however, for the entire 

30 plasmid to become stably incorporated, as long as homo- 
logous recombination occurs to an extent sufficient to 
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on the far left is introduced into a population of yeast 
cells (ovals), each of which contains a DNA YAC con- 
taining a different DNA fragment. The plasmid includes a 
selectable marker gene for selection in yeast (diagonally 

5 lined section) and a targeting DNA fragment (solid sec- 
tion) in which a double strand break has been introduced. 
In this example, one host yeast cell (#3) contains a DNA 
fragment in a YAC that is homologous to a sequence 
carried on the targeting plasmid (solid sections on clone 

— 2Q— #-3->— PvCCGmb-i-nation— between— these — two— se quenc.e_o_c.au rs 

resulting in the stable integration of the selectable 
marker carried on the plasmid into the yeast chromosome 
(YAC) . The resulting population of cells is grown under 
conditions appropriate for selection of host yeast cells 

15 stably transformed with the selectable marker gene. For 
example, they are plated on appropriate selective media, 
such as nutrient deficient media. Only those cells in 
which the selectable marker gene functions grow. Growth 
of cells under these conditions is indicative of the 

20 presence of a target DNA fragment. Although YAC are 
exemplified herein, other yeast vectors, such as YCp 
vectors (YCp50, YCpl9) can be used to construct a DNA 
library . 



25 fragment from a DNA YAC library is shown in Figure 2. 
Figure 2 illustrates the integration of a targeting 
plasmid (pl84DLARG) carrying a selectable marker (the 
yeast ARG4 gene; open box) and a segment of DNA that is 
homologous to a sequence in the DNA YAC library 

30 (targeting DNA; solid arcs on plasmid). The thin lines 
represent an insert of human or other (non-yeast) DNA 



The general scheme for selection of a target DNA 
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propagated as a yeast artificial chromosome (YAC). The 
solid black box is the target DNA fragment, a sequence of 
DNA which is a portion of YAC DNA present in a DNA clone, 
found in the library, that is homologous to the targeting 
sequence. The remaining portions of the DNA YAC are 
comprised of the YAC vector arms: the thick lines 
represent plasmid vector sequences for replication and 
selection in bacteria. The shaded boxes represent 
genetic markers used for selection in yeast (yeast 
selectable markers URA3 and TRP1). The solid arrowheads 
and circle represent telomeres (TEL) and a centromere/ 
yeast replication origin (CEN/ARS ) , respectively. Figure 
2a depicts the targeting DNA (present in the targeting 
vector) aligning with the target DNA fragment in the YAC. 
Figure 2b depicts the product of homologous recombination 
between the targeting DNA and target DNA fragment. The 
targeting plasmid has been cut uniquely in the targeting 
DNA, at the site corresponding to the vertical arrow in 
the target sequence. ULL indicates the unit length 
linear restriction fragment that results from duplication 
of the target sequence (and the restriction site) on the 
YAC. As described in Example I, a ULL can be generated 
only if integration occurs into a DNA sequence that 
contains the restriction enzyme site in question and 
contains sufficient homology surrounding that site to 
allow resynthesis (by repair) of the restriction enzyme 
site on the targeting plasmid. Candidate clones that 
display a ULL are assumed to be homologous recombination 
events and are analyzed further. 

In another embodiment of this method, a yeast- 
selectable marker gene on the incoming targeting DNA 
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molecule can be a bacterial gene, engineered to be 
expressed in yeast, which confers drug resistance to 
yeast cells, e.g., the CAT or neo genes from Tn9 and 
Tn903, or bacterial amino acid or nucleoside prototrophy 
5 genes, e.g., the coli argH , trpC, and pyrF genes. 

In another embodiment of the method of the present 
invention, the targeting vector is a linear DNA fragment 
which includes a targeting DNA sequence homologous to a 
target DNA fragment to be identified and/or isolated from 
10 the YAC library. m this embodiment, a selectable marker 
__gen^i. s _i.n S .ex.t.e<i-i.n.to-t.h.e-t-a-r-ge-ti 



targeting DNA sequence which includes two non- contiguous 
domains. This embodiment is described in detail in 
Example II and represented schematically in Figure 3. 
15 The targeting vector, which is a linear sequence which 
does not replicate in yeast, is transformed into the 
pooled DNA YAC library, as described in Example I. 
Homologous recombination occurs between the targeting DNA 
and the target DNA fragment. 
20 In addition to the above-described embodiment, other 

approaches to introducing targeting DNA into host cells 
can be used. For example, targeting DNA can be present 
on a replicating yeast linear plasmid (Murray, A.W. and 
Szostak. J.W., Nature 305:189-193 (1983)) in a yeast 
25 strain of mating type opposite to that of the host strain 
used for the library. The linear plasmid has selectable 
markers flanking the targeting DNA sequence (i.e., one at 
each end of the targeting DNA) ; both markers are 
different from those used in the construction of the YAC 
30 lxbrary and can be selected against (i.e., negatively 
selectable markers, such as LYS2 , URA3 or CYH2 ) 
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Homologous recombination between two linear molecules 
produces two linear molecules, each of which is a hybrid 
of the two parental molecules. In this embodiment, in 
which recombination occurs between the targeting linear 

5 plasmid and a library YAC , each of the two recombination 
products is a YAC and each carries one of the two 
negatively selectable markers, allowing for differential 
selection of the two recombination products. 

The basis of this differential selection is illus- 

10 trated in Figure 12. Filled circles, arrowheads, and 
open rectangles represent centromeric , telomeric, and 
marker gene sequences, respectively. The shaded boxes 
represent targeting or target sequences. URA3 cells can 
be selected against (killed) by growth on media 

15 containing the nucleoside analog 5 - fluoro - orotic acid 
(5F0A), while LYS2 + cells can be selected against by 
growth on media containing the amino acid analog alpha- 
amino-adipic acid (aaa) . Molecule 1 is a target YAC 

constructed in a vector system using ARG4 and TRP1 as 

+ 4- R R 

20 selectable markers (phenotype arg trp 5F0A aaa ) . 

Molecule 2 is a linear targeting plasmid in which the 

targeting sequence is flanked by URA3 and LYS2 (phenotype 

• s s 

arg trp 5FOA aaa ) . The phenotype of cells harboring 

molecules 1 and 2 in an unrecombined form is arg + trp + 
o c 

25 5F0A aaa . Molecules 3 and 4 are the products of 

recombination between Molecules 1 and 2, resulting from a 

cross-over between the targeting and target sequence. 

+ - R S 

The phenotype of Molecule 3 is arg trp 5F0A aaa , and 

can be selected for by growth on 5FOA plates lacking 

+ S 

30 arginine . The phenotype of Molecule 4 is arg trp 5F0A 
p 

aaa , and can be selected for by growth on aaa plates 
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lacking tryptophan. Thus, cells containing one or both 
non-recombinant molecules, as well as cells containing 
either of the recombinant products can be differentially 
selected (cells harboring only one or the other recombi- 
5 nant product arise by random loss events). 

In such a scheme, the yeast cells harboring the 
targeting linear plasmids are mated to all members of the 
library and maintained under conditions favorable for 
spontaneous or induced homologous recombination (induced 

v.~_*«.t-_. x am.n.l.e,. m eiosis or ultraviolet irradiation). 

XU "J T — * _ , 

Recombinant target YACs are selected by virtue of the 

unique phenotypes of the recombination products resulting 

from homologous recombination between the targeting 

sequence on the linear plasmid and YAC molecules 

15 harboring a suitable target sequence. Each of the two 
product YACs is truncated at the position of the target 
DNA sequence, and the differential selection is used to 
isolate the two products separately. In order to isolate 
the two products of the single event, yeast cells 

20 harborings YACs and linear targeting plasmids are prefer- 
ably plated or gridded out prior to selection for 
recombinants. Selection is accomplished by replica 
plating onto the appropriate selective plates. 

In this embodiment, the relative orientation of the 

25 targeting sequence with respect to the two (negatively) 
selectable markers on the linear targeting plasmid is 
important. Recombination between a target YAC and only 
one of the two orientations of targeting linear plasmid 
will give rise to a stable recombinant (i.e., a recombi- 

30 nant with one and only one centromere). YAC molecules 

with two centromeres show frequent breakage and unstable 
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As an alternative to mating to introduce the plasmid 
described in the preceding paragraph, the plasmid can be 
introduced by transformation, essentially as described in 
Example I, followed by the induction step to free the 
5 targeting substrate from the yeast replicon. 

Identification and Isolation of a_Target_DNA_Fragment 
Using Homologous^Recombination 

The above-described embodiments of the present 
method are useful to identify and isolate any target DNA 

jq fragment, which can be an entire gene, a gene portion or 
other nucleotide sequence. For example, a gene of 
interest, such as a £-globin gene or adenosine deaminase 
gene, can be identified in a DNA fragment library using 
the claimed method and, if desired, isolated from host 

15 cells by known methods. Identification of target DNA 

fragments by the present method is described in detail in 
Examples I, V and VI. 

H£g*gJL°-.g Q g g^Re c c > mb inatio n_ Ch r o m o s o m e_ V a 1 k i ng 

The method of the present invention, by which a 

20 target DNA fragment is isolated from a DNA library, is 
useful for isolating physically - contiguous DNA segments 
from a DNA YAC library in order to construct a physical 
chromosome map. That is, when used iteratively, each 
time with targeting DNA derived from a YAC which overlaps 

25 with and extends beyond a previously identified region, 
it is a method for chromosome walking. In the present 
method of chromosome walking, a target DNA fragment 
present in a YAC is isolated, as described above. A 
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terminus of this first target YAC fragment is subcloned 
into a plasmid vector. The terminus of the first DNA 
fragment is, thus, used as a second targeting DNA 
sequence, which is introduced into host yeast cells 
containing a DNA YAC library. The terminus of the first 
DNA fragment, which is contiguous to the first target DNA 
sequence, in turn becomes the second targeting DNA 
sequence. As used herein, the term contiguous includes 
sequences which are immediately adjacent to the first 
target sequence and those nearby or in proximity to the 
-fir-s-t-t-a-r-get-seq-ueTic-e— (-i.e. , separated from the first 
target sequence by intervening nucleotide (s) ) . This 
second targeting DNA sequence should not have any 
homology with the first targeting DNA sequence, so that 
when it in turn is incorporated in a YAC at a point of 
homology with a second DNA clone, the second DNA clone 
selected will have a different terminal DNA sequence 
The terminal subfragment from the second DNA clone is 
used to isolate the next (i.e., the third) DNA clone 
Each successive DNA clone is isolated by virtue of its 
homology with the terminal subfragment of the previously 
isolated DNA clone. A series of overlapping clones is 
obtained by repeating this process; the process is 
repeated as needed to construct the physical map desired 
The successive recovery of terminal DNA fragments allows 
rescreening the same library or a second library for 
overlapping clones. 

In one embodiment of the present invention, chromo- 
some walking is carried out in order to determine the 
chromosomal location of a gene of interest, such as a 
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gene which causes a disease, by using a DNA fragment 
displaying a RFLP genetically linked to the gene of 
interest, or a fragment contiguous with the RFLP, as 
targeting DNA in the targeting vector. A targeting 
vector, such as a bacterial plasmid, which includes the 
RFLP-displaying DNA, or fragment contiguous to the RFLP 
displaying DNA, or cDNA as targeting DNA and a selectable 
marker gene, is introduced into a human DNA YAC library. 
Homologous recombination between the targeting DNA and a 
target DNA fragment in the library results in the first 
— s t e p— i n- w aik i ng- 1 o— t h e gene o f ~~ i nterest. A YAC con- 
taining the target DNA fragment is identified in this 
way. One terminus or both termini of the target DNA 
fragment is used as targeting DNA in a targeting vector 
to rescreen the same library or screen a second library, 
as described above. Also as described above, this is 
repeated, each time using a terminus of the target DNA 
fragment isolated in the previous step as targeting DNA. 
This continues until the gene of interest is identified 
or the desired physical map is completed. 

In another embodiment of the present method of 
homologous-recombination chromosome walking, the terminal 
fragments from the DNA YAC inserts can be isolated by a 
plasmid-rescue technique. This embodiment is described 
in detail in Example III and represented schematically in 
Figure 4. In this case, the YAC vectors are designed 
such that the YAC vector arm contiguous to the DNA 
fragment (clone) insert terminus contains sequences which 
allow for plasmid replication and selection in a 
bacterial host. Restriction enzyme digestion of the 
selected YAC DNA clone produces a fragment with one end 
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lying within the terminus of the DNA clone sequence and 
extending into the YAC vector arm. This fragment con- 
tains the bacterial plasmid sequences which are 
essential for replication and selection in coli , 

covalently linked to a fragment of DNA from the terminus 
of the selected YAC DNA clone. Plasmid rescue involves 
restriction enzyme digestion of the total yeast DNA from 
the selected yeast clone; ligation of the digested yeast 
DNA to form monomer circles; and transformation of this 
ligated DNA mixture into E^ coli, with the selection for 
the marker gene in coli. 

For use in conjunction with the plasmid rescue 
technique, one can design two different DNA YAC 
libraries. Each library will utilize a different pair of 
15 selectable markers. A set of four YAC arms are designed 
containing appropriate selectable markers for the two 
different libraries. Each YAC arm contains a yeast- 
selectable marker that would be appropriate for the 
selection of host yeast cells of the other library. In 
Figure 4, the yeas t - selectable markers in Library 1 are 
ARG4 and TRP1 and in Library 2 they are LEU2 and URA3 . 

Total yeast DNA from cells containing the first 
targeted DNA YAC clone are digested with a restriction 
endonuclease that separates the sequence conferring 
replication and stability function in yeast from the 
region of the YAC cloning vector that allows selection 
and propagation in bacteria and a selectable marker that 
functions in yeast (step 3 in Figure 4). This region 
remains covalently attached to sequences containing the 
30 first targeted DNA fragment terminus. This fragment of 
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the YAC DNA clone terminus contains sequences necessary 
for replication in bacteria, a selectable marker for 
selection in bacteria, and a selectable marker for 
selection, in yeast, along with the first targeted DNA 

5 clone terminus sequence. This fragment is circularized 
and amplified in bacteria (step 4 in Figure 4). This 
product then becomes the targeting plasmid with which to 
transform the second DNA library, after introducing a 
double-strand break within the sequence corresponding to 

in the DNA clone terminus (i.e., within the trageting DNA 
sequence) (steps 5 and 6 in Figure 4). The two DNA YAC 
libraries, Library 1 and Library 2, are constructed so 
that the arms in each are stabilized by a different 
vector sequence, with each arm having a unique selectable 

15 marker for selection in yeast and a unique selectable 
marker for selection in bacteria. 

The rescue of DNA clone termini described in Example 
III utilizers restriction endonucleases to cleave a DNA 
clone in such a manner that the terminus is covalently 

20 attached to a fragment of the YAC vector arm. One of 
ordinary skill in the art will know how to isolate DNA 
clone termini by use of various embodiments of the 
polymerase chain reaction (PCR) (for example, inverse PCR 
or anchored PCR) with such reaction using at least one 

25 unique primer that anneals to the YAC vector arm 

immediately adjacent to the DNA cloning site, such that 
the first strand synthesis proceeds away from the YAC 
vector arm and copies cloned DNA, and in which specific 
restriction enzyme cleavage sites comprise part of one or 

30 both of the PCR primers which would facilitate the 
subcloning of terminal fragments from DNA YACs. 
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Even in the absence of homologous recombination 
screening, a two library system is particularly useful 
for chromosome walking. Two key characteristics of such 
a two library system are that: 1) among the total of 
four arms which must be present in the final two 
libraries, no arm shares the same marker for plasmid 
selection in E_^ coli and 2) there is limited or no 
homology between the bacterial plasmid replicons used in 
the two different libraries. 

In this system, the two unique terminal se quences 

from clones isolated by plasmid rescue (see Example III) 
from the first YAC library (Library 1) can be isolated 
independently simply by plating on different selective 
media plates. Since the isolated plasmids harboring the 
terminal sequences have limited or no homology to either 
vector arm present in the second YAC library (Library 2), 
these plasmids can be used in traditional filter hybri- 
dization screening without subcloning the terminal 
sequences from the plasmid. The plasmids rescued in E^ 
coli can be purified and labeled (e.g., by nick- 
translation or random hexamer priming) , and used directly 
to screen a second library. YAC clones isolated from 
Library 2, themselves isolated by screening with intact 
rescued plasmids carrying terminal sequences from YAC 
clones isolated from Library 1, represent steps taken in 
a chromosome walk. Each walking step thus proceeds by 
using labeled plasmids derived from the ends of YAC 
molecules isolated from one of the two libraries to 
directly screen the other, complementary, library. This 
method greatly improves the efficiency of traditional 
filter screening techniques by providing a rapid method 
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for independent isolation of each of the two YAC termini 
by differential selection in forms suitable for direct 
labeling and library screening. It eliminates the need 
to subclone or otherwise purify terminal fragments for 
5 the purpose of labeling and screening for overlapping YAC 
clones . 

The design of the YAC vector arms and the restrict- 
ion enzymes used for plasmid rescue should be such that 
the yeast selectable marker (as well as the centromeric, 
10 telomeric, and yeast replication sequences) is separated 

f-r-o-m— the— r-eseued-- p±arsia£&—Bvqptt&ertt&—tl£e^l&C~cl&M 



terminus. This eliminates the need to use different 
yeast selectable markers in the construction of Libraries 
1 and 2, and to construct a host yeast strain with 
15 complete deletions of the selectable markers used to 
select for YAC clones in Libraries 1 and 2 . Unique 
selectable markers for each of the four arms, which make 
plasmid selection in coli possible, can be, for 
example, a gene encoding resistance to an antibiotic, 
20 such as chloramphenicol, kanamycin, ampicillin, tetra- 
cycline, spectinomycin, streptomycin, or erythromycin, or 
a gene encoding a biosynthetic marker for which a suit- 
able auxotrophic host exists. 

Bacterial replicons which can be used in order to 
25 limit the homology between those in the two libraries 

are, for example, P 15A, ColEl, phage M13, phage f 1 , phage 
Lambda and their equivalents. 

gost_Cell Types and Characteristics 

The method is described herein with particular 
30 reference to screening YAC DNA libraries constructed in 
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yeast cells through the use of targeting DNA sequences 
present in bacterial plasmids. It is to be understood, 
however, that this is merely for purposes of exempli- 
fication and that the present method can be carried out 

5 using other host cell types, provided that genetic re- 
combination between vector-borne DNA and DNA already 
present in the host cell occurs by homologous recom- 
bination and that an appropriate non - r ep 1 ic a t ing 
targeting vector is available. 

10 Appropriate eukaryotic host cells include those 

which normally (as they occur in nature) undergo genetic 
recombination essentially exclusively by homologous 
recombination (e.g., Saccharonijces c erevis ija e , 
i£!li2£saccharora^ce s 2£!5^£) • As used herein, the term 

15 essentially exclusively means that homologous recombi- 
nation occurs without significant levels of non- 
homologous recombination under the conditions used. 

Homologous - recombination selection of DNA clones 
could be utilized as a selection method in the cells of 

20 any organism in which 1) a suitable DNA cloning system 
exists and 2) the cells can be manipulated or induced by 
genetic engineering or genetic manipulation to perform 
recombination which is predominantly based on DNA 
sequence homology, or in which the targeting DNA can be 

25 treated in such a manner that it engages in homologous - 
recombination as its preferred mode of recombination. 
With these criteria met, one skilled in the recombinant 
DNA arts could perform homologous - recomb inat ion selection 
of DNA clones from a DNA library. Such organisms may 

30 include, but are not limited to, Schizo sac char omyces 
E£2l>£ » £££££E.ilil£ Eli^no^as t er , Homo sa£iens , Mus 
mus cuius and S£odo£tera f rugip er dea . 
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Saccharomyces cerevisiae is a preferred host 
organism for the selection of DNA clones using homo- 
logous-recombination because of its ability to route 
transforming DNA carrying double-strand breaks into a 
5 recombination pathway based virtually exclusively on DNA 
sequence homology. 

Certain characteristics of host cells in which DNA 
fragment libraries are constructed should be considered 
and possibly modified to optimize use of such cells in 
1 0 the present method , such as by decreasin g non-tar g eted 
events and, thus, increasing the efficiency of the 
method. For example, as described below, it might be 
necessary to remove selectable markers present in the 
targeting vector from host yeast cells and to construct 
15 targeting vectors in such a manner that they include no 
sequences homologous with those in the vector sequences 
used in the propagation of the DNA library. 

As described below, it has been determined that the 
selectable marker gene(s) chosen for the targeting vector 
20 should not normally be present in the host yeast genome 
or should be deleted from normal chromosomal position(s) 
in the host yeast strain. Without this modification of 
the host strain, recombination events between the select- 
able marker and the yeast genome would occur at a higher 
25 rate. For near - complete (> 99%) coverage of the human 
genome, a DNA YAC library with an average fragment size 
of 300 kb would consist of approximately 50,000 members 
(Maniatis, T. et al . , Molecular Cl onin g- A, Lab oratory^ 
Manual , pg 271 » Cold Spring Harbor Laboratory Press, Cold 
30 Spring Harbor, New York, 1982). In order to isolate 
sequences that are represented only once in such a 
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library, the ratio of targeted t-„ „ 
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....... ..d can be .ini.i,. d by a.o«.,f Y 

*" Sample VII, it Was 



25 



30 



determined that deleting th 7 * " 

selectable marker p chromosomal copies of 

desirabl K PreSent ° n the -ed is 

desirable because it reduces th* n ~ 

t ar «,of=^ the occurrence of non- 

vectors. present on targeting 

Non-targeted events mitrht 

».->..«... k .i.; "n\r ulc ° f 
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consistent with the Invention described in this appli- 
cation. It was possible to select yeast cells carrying 



10,000 of the cells transformed had such homology. In 
5 fact, at this dilution targeted events were isolated 
multiple (four) times, Indicating that a clone repre- 
sented once in a library of 40,000 clones could be iso- 
lated. 

— 10 Targeting vectors or vehicles useful in the method 

described herein are also the subject of the present 
Invention. One type of targeting vector of the present 
Invention has two key characteristics: the vectors are 
non-replicating in the host cell in which the DNA frag- 
15 ment library Is constructed and include a DNA sequence, 
referred to as targeting DNA, which is homologous at 
least in part to a target DNA fragment which, for the 
purposes of the invention, is a DNA fragment comprising 
all or a portion of a desired clone to be identified in 

20 and isolated from the DNA library. Targeting vectors 
will generally be bacterial plasmids of the Yip class, 
particularly in those cases in which yeast cell hosts are 
used. Vectors appropriate for other types of cell hosts 
can also be constructed using known techniques. 

25 Sequences used as targeting DNA in the targeting 

vector can be entirely homologous to the target DNA 
fragment, although they need not be. They need be only 
sufficiently homologous that under the conditions used, 
genetic recombination between vector-borne DNA introduced 

30 into the cells and DNA in YAC in the cells occurs by the 



homology to the targeting vector even when only 1 in 
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host cell recombination pathway or process. Preferably, 
a double strand break or gap is introduced into a tar- 
geting DNA sequence. The free ends adjacent to the break 
or gap can be modified to prevent recircularization 
(e.g., by phosphatase treatment of the ends of the DNA, 
by creating non- complementary ends by using two different 
restriction enzymes or by removing nucleotides from one 
strand of the DNA, producing a single stranded tail). A 
survey of the literature reveals that s ingle - s tranded 
(3') overhangs are intermediates in genetic recombination 
— in yeast and other species (Sun, H. et al^ Cell, 64:1155- 
1161 (1991); Maryon, E. and Carroll, D. Mol . Cell^ Biol^ 
11:3268-3277). It is reasonable to expect that the use 
of DNA modifying enzymes that degrade one strand of a DNA 
duplex (such as the strand with 5 '-3' polarity) on one or 
both sides of a double - strand break in this case, 
resulting in molecules with single stranded 3' overhangs 
on one or both sides of a double - s trend break or gap) may 
be useful in producing substrates that have enhanced 
ability to function as targeting molecules in homologous 
recombination library screening. 

In addition to targeting DNA, targeting vectors 
include a selectable marker gene that functions in yeast, 
an origin of replication and a selectable marker that 
functions in bacteria (e.g., E^ coli . ) . The selectable 
marker gene is one which is functional (makes selection 
of transformed cells possible) in the host cell type used 
for DNA fragment library construction. The choice of the 
yeast selectable marker gene can be made from among many 
various endogenous yeast gene loci, e.g., ARG4 , LEU 2 , 
HIS3. HIS4, THR1, URA3 , TRP1 , LYS2 , ADE2 , ADE8 , and MET2 
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Alternatively, the yeast selectable marker may be a 
marker gene that is not endogenous to the yeast genome, 
but is a foreign gene that confers a selectable pheno- 
type, e.g., a bacterial gene engineered to be expressed 
5 in yeast and confer drug resistance on the yeast cells 
(such as the CAT or neo genes from transposons Tn9 and 
Tn903, respectively) or nutrient prototrophy, such as 
amino acid or nucleoside prototrophy (such as E. coli 
argH, trpC, or pyrF genes). Other selectable marker 
10 genes useful for this purpose include genes which confer 
to-lera-ace— to— metral ions (e~g~ the CUP I gene , wh~ic~h 
confers resistance to copper ions), genes which confer an 
ability to progress through the cell cycle on cells with 
a mutant phenotype and genes which result in expression 
15 of a cell surface marker. 

The suitable selectable marker genes for selection 
in bacteria include the genes encoding resistance to the 
antibiotics , chloramphenicol , kanamycin , amp ic ill in, 
tetracycline , spec tinomycin , streptomycin, erythromycin, 
20 or any other marker, including genes encoding bio- 
synthetic enzymes for which auxotrophic bacterial hosts 
exist . 

Bacterial origins of replication may be derived from 
a variety of sources, including pl5A (exemplified by the 
25 origin of plasmid pACYC184) , ColEl, phage M13 , phage f 1 , 
phage Lambda, or any other replicon that one trained in 
the art would recognize as providing an equivalent 
function . 

Vectors constructed and used to screen YAC DNA 
30 libraries are described in detail in Example III and 
represented schematically in Figures 5 and 6a-6d. 
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Targeting plasmid p 184DLARG contains a selectable marker 
functional in yeast (ARGA) and a bacterial origin of 
replication (derived from pACYC184). 

Targeting DNA molecules are not limited to molecules 
of the Yip class. The targeting DNA can be a fragment of 
DNA purified from a larger plasmid, with such a plasmid 
constructed in such a manner that the desired targeting 
sequence is interrupted by, among other sequences, a 
bacterial or yeast replicon. The plasmid is also con- 
structed in such a manner that upon cleavage with a 
restriction enzyme that will release the replicon from 
the inner section of the targeting sequence, a yeast 
selectable marker remains covalently linked to the outer 
two ends of the targeting sequence. 

Alternatively, a selectable marker and a targeting 
sequence can be ligated together in vitro, and ligation 
products consisting of one copy of the targeting sequence 
and one copy of the selectable marker (or multimers 
consisting of alternating targeting and selectable marker 
sequences in a uniform orientation) are purified. These 
ligation products are circularized in vitro and cleaved 
with a restriction enzyme to introduce a doub le - s trand 
break of gap in the targeting sequence and leaving the 
selectable marker intact. 

Finally, the two halves of a targeting sequence can 
be ligated to a selectable marker in a single three-way 
ligation in vitro to generate a targeting molecule 
suitable for transformation. 
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Yeast A rm Vectors 

Yeast arm vectors or YAC arm vectors, which are used 
to produce yeast artificial chromosomes, are also the 
subject of the present invention. YAC arm vectors 

5 include a yeast selectable marker gene, a bacterial 

origin of replication, a bacterial selectable marker gene 
and a yeast telomere. They may additionally include a 
yeast replication origin (ARS) and/or a yeast centromere 
sequences. The components of these YAC arm vectors can 
be obtained from sources in which they occur naturally or 
can be produced using recombinant or genetic engineering 
techniques or chemical synthesis. For example, the 
telomere sequences, centromere sequences and ARS can be 
obtained from yeast or from another organism. It is only 

15 necessary that they function in yeast host cells as, 
respectively, a telomere, a centromere or an ARS. 
Components which have equivalent functions, regardless of 
their source (e.g., yeast or other source) are referred 
to herein as functional equivalents of the corresponding 
yeast element. 

The present invention is illustrated by the 
following Examples, which are not intended to be limiting 
in any way . 

Methods Used Herein 

25 Unless otherwise noted, methods for plasmid purifi- 

cation, restriction enzyme digestion of plasmid DNA and 
gel electrophoresis, use of DNA modifying enzymes, 
ligation, transformation of bacteria, transformation of 
yeast by the lithium acetate method, preparation and 

30 Southern blot analysis of yeast DNA, tetrad analysis of 
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yeast, preparation of li quid and solid ^ ^ 

^ ££li and yeast, and all standard molecular bio- 
logical and microbiological techniques can be carried out 
essentially as described in Ausubel et al . (Ausubel p M 

Pubflini— r-^^-^^^USLlosi, Greene" ' 
Publi shlng Associates and Wiley-I nterscience> New york> 

SAMPLE.! SELECTI0N_BY_H0M0L0G0US^REC0M^ 0F 

A_TARGETED_DNA_CLONE_FROM_A_DNA_YAC 

.LIBRARY- ~ 



lib J <iICC " 7380) " aS »" 4 '..« »»"r»a . 

til. I .I""" 8en<> ° IC DBA - " U "" "»*'-" isolated 
'«•«..«• c.U. (D . Burk ., Ph.D. The sis , 

with EcoRl and BamHI . digested 

yeast T ho e s t ligati0n mlXtUre then US6d C ° tr.».f.„ 

yeast host strains, either MGD131-10c or iv-lfid 

20 the spheroplast method (Burgers P „ j I " ' ^ 
K J mor,n * , 8 P - M -J- and Percival, 

(198?) ^ica U ic £te£2 163:391-397) (The 
construction of host strains MGD131-10c and IV-l 6d wi « 
the appropriate marker deletions is described - . 
»« oelo W .> Since the pYACA vector eerrle ^"y 
25 selectable markers TRP1 and URA3 , transf ormants can be 

" i:r i f r 6 :i Y e ;r th on piates - 

uracil. U,625 YACs with an average size of l90 kb (0 73 
hun,an genon,e equivalents) are individually grown i w 
wells of ua ii * iy grown in the 

30 ...» r^r^: 1 ™-:.;; 1 - — - 
-oo .,.„.. .... Fcr .. ch . uhP :. 1 :-.r.::::t::;: i : f 
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30% glycerol was added and the subpool was aliquoted and 
frozen at -70°C. 

For a library comprising 73% of one genome, and 
assuming equal representation of all clones, the proba- 

5 bility that it contains any one specific human DNA 

sequence is just over 0.5. The probability that one of 
six different fragments of DNA is represented in the 
library is l-(0.5) 6 , or 0.98. 

The construction of the targeting plasmid pl84DLARG 

IQ — is— described— be-l-ew— and- i-l-l-us t rated— in— Figure— 5-. It 

carries the yeast ARG4 gene (Beacham, I.R. et al. (1984) 
Gene 2£:271-279) as a selectable marker, and its 
bacterial origin of replication is derived from pACYC184 
(Chang, A.C.Y. and Cohen, S.N. (1978) Journal_of 

15 Bacteriology 134:1141-115 6.), which shares only limited 
sequence homology to the pBR322 origin used on pYAC4 . 
The entire chromosomal copy (a 2.0 kb Hpal DNA fragment) 
of ARG4 has been deleted in the library host strains 
IV-16d and MGD131-10c. The 2.2 kb Bcll-Clal fragment 

20 from pACYC184 (Chang, A.C.Y. and Cohen, S.N., (1978), 
Journal of Bacteriology 134: 1141-1156 . ) containing the 
pl5A origin of replication and the chloramphenicol 
resistance gene was ligated to BamHI-AccI digested pMLC28 
(a derivative of pSDC12 carrying the pUC18 multiple 

25 cloning site; Levinson et al. , j^.Mply AppL^Gen ; , 

2:507-517 (1984); plasmid pUC18 (ATCC #37253) can sub- 
stitute for pMLC28 in the construction of pl84DLARG 
described here) . BamHI and AccI cut this plasmid one 
time each, in the polylinker. The ligation mixture was 

30 digested with SacI and Hindlll , which cut in the PMLC28 
polylinker, and the digested DNA was treated with T4 DNA 
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*'•» (to »».„.,. .„, p , r . ncal molecul(is) 
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. oiai £ " sm " es •* - • 

f"d. H....d pHp.5 (provided by N . Schultes .„d j 

^ D """—" " Mol.eul.r Blol H....o hu .. tt . 

O.neral Hospital . »„...„ "" 

. the ARG4 

gene as a 2.0 Kb Hpal f ragment inserted 

site of P MCL12 (a derivative of p SDC1? . 

-itipi. clonlng site) . Levin ; o ? c - pnc» 

2:S0 7 - 517 (1984) . This pa.-^.'^i^ 
ana S.a! sices f lanking the ARG4 ; ^ "« 

carrying a single copy of the ARG4 gene inserted i w 
orientation shown in Figure 5 was . " "'^ ln the 

P184DLARG. Figure 5 * "** «* designated 

Gfinomi ' S 5 1S 3 nap 0f P 1 "™** P184DLARG. 

r :itai r i a6 :: nts for tyrosine cc*.... 

xx; , nietallothionein tt j 
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Hpal-BamHI fragment from the 5' end of the bete globin 
gene (chromosome 11) was blunt-end ligeted to the same 
2.2 kb Bcll-Clal fragment used to construct pl84DLARG . 
The beta- and eps ilon- globin fragments are 1.3 and 1.9 kb 
fragments, respectively, f rom the human beta-hemoglobin 
locus on chromosome 11. The beta-globin fragment (ATCC 
#39698) was subcloned from pHUS'beta (Treco, D . et al 
M£^-C£ll^Bi£l^, 5:2029-2038, 1985), and includes 
sequences from positions 61.338 (Hpal site) through 
62,631 (BamHI site) in the Genbank HUMHBB sequence This 

-_f.r.agme^inc.l^ beta-globin 

gene. The Avail site at Genbank map position 62,447 was 
used to introduce a double-strand break for targeting 
leaving 1.1 and 0.18 kb of homology on either side of "the 
break. The 5' ep S il on - g i obin probe (ATCC #59157) is a 
HindHI fragment and includes sequences centered ' 
appro*** ately 15 kb 5 • to the epsilon-globin gene (ATCC 

HUMHRR ' P ° Sitions 3 ' 2 " trough 5,172 in the Genbank 

HUMHBB sequence. The Apal sites at map positions 4,361 
and 4.624 were used to create a 0.26 kb double-strand gap 
for targeting, leaving 1.1 and 0 .5 kb of homology on 
either side of the gap. 

Properties of the remaining four genomic DNA frag- 
ments are as follows: tyrosine hydroxlase (chromosome 
XI. 2.3 kb BamHI fragment; ATCC #59475; double-strand 
break made with HindHI, 0 . 6 kb from end); me tal- 
lothionein pseudogene (chromosome 4; 2.8 kb HindIII- Ec oRI 
fragment; ATCC #57117; double-strand break made with 
Ndel, 0.4 kb from end); anonymous DNA marker D16S3 
(chromosome 16; 1.5 kb HindHI fragment; ATCC #5 9447- 
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D16S37 (chromosome 16; 2 . 3 k b Hi „dIII £r .„. nt . „„ 

.. ub i.-.„. na br .. k „ d . vlth Ap j O .; 5 \ r b oc £rom 

E.ch t.r,.tlng pl.,»lo ... i ln . ari2ed „ lth . r> _ 
•tri.tl.. .„*,„. th « cuts wlchln hum>n 

««..«„. DM, ... „ „ o £ „„.«.« DNA „a s .... * o 
tr.n. fot „ he pooled llbt , ry ^ uai ^ 

ll»..r, subpools vete ^ ^ 

10 CM -»r. . txp aediu „ „„ tai „ liig „ ^ each 

..Plollll.. This SHto .^^, 

*.»!«, .f M 10 7 „ lls/ml « 

o.i,.. 1Ithlu „ ... t „. „ ethoa »«• 

' treene Polishing Associates and Wilev.I n t B r • 
New York, 1987) 20 u „ i Wll «yInter S cience, 

-l*o/,. 20 pg of plasmid within 

W " »"* to transform 7 x 10 « cells " \ , 
0 2 ml «^ . i ceils in a volume of 

20 - ~" : f »=: r ;::::: tvr — - 

-edia lacking uracil tmtMh (complete minimal 

5 uracil, tryptophan, and areinine} = 
incubated at 30'C for 3-7 days. einin *> and 

Transformants were analyzed by restrict-,- 
digestion and Southern hybridization enZy, " e 

» Prepared from each of th candid . ^ *"* 

candidates and digested with 
the same enzyme used to linearize the tar. I 
The Southern blot was probed with £, TAllllV^' 
-A. Homologous integration events re £ ^ 

ybridi zation t0 a single band of exactly the Le 
Length 'r 6 " 12 '* *•»!»« DNA aolecule [the ..^ 

Length Linear- band (ULL) ; Figure 2] A ULL 

& * J . A ULL can only be 
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generated if integration occurs into a DNA sequence that 
contains the restriction enzyme site in question, and 
contains enough homology surrounding that site to allow 
the re-synthesis (by repair) of the restriction enzyme 
site on the targeting plasmid. Candidates that display a 
ULL are assumed to be homologous integration events and 
are subjected to further analysis. Unit length linears 
were seen for 6 of 21 epsilon-globin candidates analyzed 
and for 3 of 14 beta-globin candidates. No unit- length 
linears were observed in candidate clones isolated with 
any-of-th-e— other targeting - f ragments used. 



Figure 7 is a restriction enzyme and Southern blot 
analysis of clones selected by targeting with human 
epsilon- and beta-globin sequences. In the left panel, 
DNA from nine clones selected as arg+ were digested with 
Avail (the enzyme used to make the double -strand break in 
the beta-globin targeting sequence). In the right panel, 
DNA from nine clones selected as arg+ were digested with' 
Apal (the enzyme used to make the double - strand break in 
the epsilon-globin targeting sequence). The asterisks 
identify clones correctly selected by homologous recombi- 
nation. The lanes marked M were loaded with purified 
beta-globin targeting plasmid digested with Avail (left 
panel), or purified epsilon-globin targeting plasmid 
25 digested with Apal (right panel). The size of this 

marker fragment is identical to the size predicted for 
correctly targeted events. The arrowheads indicate the 
fragment size predicted for correctly targeted events, 
5-6 kb in the left panel and 6.2 kb in the right panel. 
30 Hybridization was with 32 P labeled ARG4 DNA. 
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.«!« probin8 wlt „ , p , Uoii ^ „„. 

« .„r.,rl.„. Ihla a „ alyai . 0 „ 0 „ tt . t . d that 8 " 
VAC. „. .„„ c „ rjr b#t> ^ 

ii.bi. m, .s „„ ld „. axp „ ted slnc> * 

1» o,ly 40 Kb .p. rt on human „„„„,„,,. «"•* 
■0 .AC. ti . ARG4 DHA b aa „„„ . „ ,/ , ' J 

the P184DL ARG cor,,,-,. . .. y " kb and 

into the rv" ince * rated « predicted 



A "^B^acea as prec 

into the homologous DNA within the g lo bin locus 

Homologous recombination has been successfully used 
to isolate uni q ue genes from . DNA YAC library. Te s 

:: r:: rrr r entire beta - giobin io <- *~ 

globin \ 6PSil0n g6ne d ° Wn to beta 

Slobin gene, along wlth about kb . f 

. ;r:;r:.:::ir:-;:." : 

20 from the ^-globin locus after the hk . 

at -70-r * library had been stored 

:::::VT . b_ CBA vAr:: b :: t r b r:::";::.:" ia " 

recombination selection. 

USING_0NE^STEP_GENE_DISRUPT10N 
The method of one-sten ... ~ 

R t « v P 6 disruption (Rothsteir, 

R.J-. Methods_in_Enz 2 molo £ y, 101-202 211 . ' othst ^- 
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of clones from DNA libraries by homologous recombination. 
In this embodiment, a selectable marker is inserted into 
the targeting sequence. The targeting sequence, with the 
embedded selectable marker, is subsequently isolated as a 
single linear fragment (as diagrammed in Figure 3) and 
transformed into the pooled DNA YAC library, as described 
in Example I. Correctly targeted clones arising as a 
result of homologous recombination between the targeting 
molecule and specific DNA clones within the library will 
carry a single copy of the targeting sequence that is 
— disrupt-ea-b-y-^the presence of the selectable marker, and 
will migrate at a specific and predictable position after 
restriction enzyme digestion and Southern blot analysis, 
using either ARG4 or the targeting sequence as a radio- 
labeled probe. This is in contrast to the process 
described in Example I, in which the correctly targeted 
DNA clones have two uninterrupted copies of the targeting 
sequence flanking the selectable marker. 

Figure 3 illustrates the selection by homologous 
recombination of a DNA clone from a DNA YAC library using 
one-step gene disruption. The thin line represents an 
insert of DNA in the form of a yeast artificial chromo- 
some (YAC). The solid box is the DNA fragment, a 
sequence of DNA constituting a portion of a DNA YAC clone 
found in the library that is homologous to the targeting 
sequence. In the diagram, the targeting sequence (solid 
boxes) has been modified by the insertion of the yeast 
ARG4 gene (open box). The remaining portions of the DNA 
YAC are comprised of the YAC vector arms: the thick 
lines represent plasmid sequences for replication and 
selection in bacteria. The shaded boxes represent 
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genetic markers used for selection in yeast (yeast 
selectable markers URA3 and TRP1) . The solid arrowheads 
and circle represent telomeres (TEL) and a centromere/ 
yeast replication origin (CEN/ARS), respectively. Figure 
3a depicts the targeting molecule aligning with the 
target sequence on the DNA YAC . Figure 3b depicts the 
product of homologous recombination between the targeting 
and target sequences, with the targeting sequence having 
replaced the target sequence. 

As a specific example of this embodiment of the 
basic conce pt, t-.hc 
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fragment (see Example I) is subcloned into the HindHI 
site of pUC18 (ATCC *37253). The resulting pl asmia is 
digested with Apal, drop P i ng out a 0.26 kb ApaI fragnent 
from the central portion of the 5' epsilon- gl obin insert. 

3 Apal overhangs are made blunt with T4 DNA poly- 
merase, and the resulting material is li gated to th / 
purified ARC4 2.0 kb Hpal fragment (Beacham, l. R Gene 
12: 2 71-1 79 , l9i4> . The resulting piasmid> with ^ to. 

with Hmdlll and transformed into the DNA VAC l ibrary as 

described in Example 1. The specific examnl. 

results in the replacement of o'., » ^ 

.-bin DNA with the ARG4 sequence, since 1 

unique in the targeting sequence. For enzymes that are 

;;; U ; ^ '""^ ™«« however, the result 

will be a simple insertion. 
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~ AMPLE_III S 0 OLOG 0US_- RE C OMB I N A TIP WCHRflMnsftM v 
HALKING_UTILI2ING_TW0_YAC LIBRARIES 
Construction of Yeast_Artlf icia^ChrJ^Iome ( YAC ) 
Libraries 

A-l) Saccharomyces Cerevisiae 

SglS- Strain Construction 
The construction of a strain of S. cerevisiae 
carrying chromosomal deletions of each of the four 
genes used as selectable markers on the four YAC 
vectors described can be carried out a s follow- 



A . 1 . a ) Deletion_of_ARG4j_ 

The internal 2.0 kb Hpal fragment carrying the 
entire structural gene and regulatory elements for 
the yeast argininosuccinate lyase gene (ARG4) is 
deleted from a plasmid consisting of the 11 kb BamHI 
fragment isolated from p(SP013)2 (Wang, H-T et 
£1 Molecular and_Cellular_Biolog^ 7:1425-1435, 

-37 J int ° BamHI ° f PUC19 < ATCC 

-37254). by digestion with Hpal and relegation of 

the DMA under dilute conditions (1 „ g/al) . The 
resulting plasmid is digested with BamHI and intro- 
duced into an S. cerevisiae strain carrying the 
wild- type alleles for ARG4 , TRP1, URA3 . and LEU2 
and carrying any non-reverting hisS' allele The 
transformation is carried out i n conjunction with 
any plasmid carrying yeast CEN and ARS elements, and 
the yeast HIS3 gene, using standard co-trans- 
formation conditions (Ausubel. P.M. et al . . Current 

Publ.sh.ng Associates and Wiley- Interscience . New 
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readily be constructed by subcloning the 1.7 kb 
BamHI fragment from pRB15 (ATCC #37062) into the 
BamHI site of YCp50 (ATCC #37419). His + cells are 
screened for arginine auxotrophy by replica plating 

5 onto CM -arginine plates. His* arg" cells are grown 

in the absence of selection for HIS3, and single 
colonies are isolated and screened for histidine 
auxotrophs . DNA from his" arg" colonies is prepared 
and analyzed by restriction enzyme and Southern blot 

10 analysis to identify t r ans f ormant s carrying the ARG4 

deletion ( arg4~A~)~ Thi s protocol is used to generate 
strain MGD131-10c used in Example 1 above. 
A . 1 . b ) Deletion_of_TRPl2 

In a yeast strain of opposite mating type as 

15 that used above, also carrying mutant alleles for 

LEU2 and URA3 (leu2* t ura3"), an identical procedure 
is carried out, but using a linear fragment of DNA 
carrying a deletion of the yeast gene for N-(5'- 
phosphor ibosy 1 ) - anthranilate isomerase (TRP1) . 

20 This is accomplished by subcloning the BamHI-XhoI 

fragment from pBR322 - Sc4120 (Stinchcomb, D.T., et 
a 1 . , Journal^ of _Molecular_B i olo^y , 15 8 : 1 5 7 - 1 7 9 , 
1982) into BamHI-XhoI cut pGEM7 , (Promega, Madison, 
Wisconsin) followed by deletion of the 1.2 kb EcoRI 

25 fragment containing TRP1 and ARS1. The resulting 

plasmid, pK2 , is digested with BamHI and Xhol and 
co- transformed with a HIS3-CEN-ARS plasmid, like 
that described in A.l.a) above, selecting for 
histidine prototrophs , and following the strategy 

30 outlined in A.l.a.) above to identify cells carrying 

the TRP1 deletion (trplA). These cells are mated 
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with cells carrying arg4A, and diploids heterozygous 
for the two deletions are isolated. This strain, 
TD7-16d, is sporulated, subjected to tetrad 
analysis, and spores with appropriate phenotypes are 
5 analyzed by restriction enzyme and Southern blot 

analysis to identify a strain with both the arg4A 
and trplA alleles (IV-16d used in Example I above). 
The genotype of TD7-16d is: a/a, arg4A /ARG4 , 

LEU2/leu2-3, 112, ura3 - 5 2/URA3 , trp 1 - 2 8 9/trp lA , 

j. __/"»_ / .3 /i . t _S_ , , _r - . 

xu luc^-iui/auei-iui, cyn /cyn , {uxttz/ cytx'Z ) , 

his3Al/his3Al 

A . 1 . c ) Deletion of LEU2 and_URA3j^ 

Strain TD7-16d is used as the recipient in 
additional co - transformation experiments, first with 
15 a linear DNA fragment carrying an internal deletion 

of the 1.3 kb HincII-AccI fragment corresponding to 
the yeast 0 - isopropy lmalate dehydrogenase gene 
(LEU2), and subsequently with a linear fragment 
carrying an internal deletion of the 0.85 kb 
20 Pstl-Nsil fragment corresponding to the yeast 

orotidine-5 ' -phosphate decarboxylase gene ( URA3 ) . 
The plasmids YEpl3 (ATCC #37115; Broach, J.R., et 

Gene, 8:121, 1979) and YIp30 (ATCC #37109; 
Botstein, D. t et al. Gene, 8:17- 24, 1979) are used 
25 as sources for constructing deletion derivatives of 

the LEU2 and URA3 genes, respectively. A diploid 
that is heterozygous for all four deletions is 
sporulated, subject to tetrad analysis, and screened 
for haploid colonies that have the minimal genotype 
30 MATa arg4A trplA leu2A ura3A. This is the recipient 

strain for constructing Libraries 1 and 2. (See 
Figure 4.) 
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A . 2 ) C ons t r uc t i on_o f _Ye a s t_Ar t i f i c i a 1 
P^£21£some_XXAC2_Vectors2 
The construction of an artificial chromosome 
requires that sequences capable of stabilizing the 
5 ends of linear DNA molecules (telomeres or TEL 

elements) be ligated to each end of the DNA chosen 
for cloning. In addition, each end needs to carry: 
1) a yeast gene that can be used for genetic 
selection in the initial construction of the library 

10 and for subsequent use as a selectable marke r for 

use in selecting clones out of a library by homo- 
logous recombinat ion, and 2) sequences that allow 
replication in coli and confer antibiotic resis- 

tance in E. coli (selectable markers). Each end 
15 should also carry a sequence that functions as an 

initiation site for DNA replication (an ARS 
element). Finally, one and only one, of the two 
ends must carry a sequence that functions as a 
centromere in yeast (a CEN element). 
20 To ensure that each linear DNA molecule trans- 

formed into yeast has two different ends (only one 
of which caries a CEN element), to facilitate the 
identification and recovery of each end uniquely, 
and to generate the two YAC libraries (Library 1 and 
Library 2), a total of four different ends are 
needed, utilizing four different yeast genes and 
four different antibiotic resistance markers. All 
of the various elements described above are ligated 
together in specific arrangements to generate yeast 
artificial chromosome vectors which can be propa- 
gated and manipulated in E. coli. To minimize the 
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possibility of homologous recombination between the 
ends of artificial chromosomes in Library 2 and 
targeting plasmids isolated from Library 1, the 
bacterial origins of replication on the vectors used 
in the construction of each individual library are 
from different sources. So that the final vectors 
are compact, easy to manipulate, and unlikely to 
rearrange by virtue of the duplicated bacterial 
origins of replication, each of the four ends is 
maintained as a different plasmid in bacteria, in 
" contrast to the invention described in U.S. Patent 
No. 4,889,806. 

A . 2 . a) Cons true tion_of_a_CEN^ARS_Element 

The PstI site of P UC19 (ATCC #37254) is removed 
15 by blunting with T4 DNA polymerase and recirculari- 

zation with T4 DNA ligase. The resulting plasmid 
(pCU19/Pst- is cut with EcoRI and Smal and the 3.1 
kb EcoRI-Smal fragment from A75p9 (carries ARS1, 
TRP1, and CEN3 ; Murray, A.W. and Szostak. J.W.,' 
Nature. 305:189-193, 1983) is inserted. The 
resulting plasmid (pTIOH) is cut with StuI and 
BamHI, removing the TRP1 gene and all CEN3 
sequences. The StuI-BamHI fragment carrying the 
P UC19/Pst- backbone and ARS1 is gel purified and 
ligated to a 382 bp Sau3A-ScaI fragment carrying 
CEN3 isolated from A75p9 (Murray, A.W. and Szostak, 
J.W., Nature, 305:189-193. 1983). The resulting 
plasmid ( P T12H) carries ARS1 sequences from 
positions 829-1453 in the published TRP1 sequence 
(Tschumper G. and J. Carbon, Gene, 10:157-166, 1980) 
fused to CEN3 sequences 1-382 (Bloom, K.S. and J. 
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Carbon. Cell, 29:305-317, 1982), with both fragments 
inserted between the EcoRI and BamHI sites of the 
pUC19/Pst" polylinker. 

A . 2 . b ) Construct i o n_ o f _ a_ Y A C_ ARM VECTOR 

pTKENDAg 

The Sau96 site of pMLC28 (pSDC12 with pUC19 
polylinker; Levinson, A., et al. J^._Mo 1^_A£ E 1^_G en. , 
2:507-517, 1984) is removed by blunting with T4 DNA 
polymerase and recircularization with T4 DNA ligase. 
The resulting plasmid ( P MLC2 8 /Sau ' ^ j_ s rfu.^.j vith 
EcoRI and BamHI, and annealed with oligonucleotides 
1 and 2 (Figure 8a), and treated sequentially with 
T4 DNA ligase, T4 DNA polymerase, and T4 DNA ligase. 
The treated molecules are transformed into E. c o 1 i , 
and chloramphenicol resistant trans formants are 
screened for the presence of an Apal site expected 
to be found in recombinant plasmids carrying the 
oligonucleotides. Plasmids which also regenerate 
the EcoRI and BamHI sites are subjected to dideoxy 
DNA sequence analysis. One plasmid with the correct 
sequence (pMLC28/SL) is digested with EcoRI, blunted 
with T4 DNA polymerase, and ligated to the 2.0 kb 
Hpal fragment carrying the yeast ARG4 gene. 
(Beacham, I.E., et aL, Gene, 29:271-279, 1984). 
The resulting plasmid with a single insert of the 
Hpal fragment (pT20) is cut with BamHI and Hindlll. 
and mixed with a purified 0.7 kb BamHI. EcoRI TEL 
fragment and the 1.0 kb EcoRI -Hindlll fragment con- 
taining ARS1 and CEN3 from P T12H ( Sec ti onA . 2 . a . ) . 
Transformants resulting from this three way ligation 
are screened by restriction enzyme analysis. The 
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ZhT it <pi2i> i= - ith «* 

..rxv.a ftom ^ ^ ^ • 
...ftd pTKEHDA . lllu . tt , te , ^ 

«P of pTKENDA, .tth r.l„„t f.tur.s .„„ 

L.I T H """ II: X: R: «••"; Xb: 

" Se " e; C » : «M««Ph«io.l tBlm „, 
ORI(pMLC28): pMLC28 nv*„< ^ ' 
ARS.L- J, 8 ° rtSln ° f "Pliotl,,; CEN3 , 

.centromere) and ARS1 (replication 



- . 7 & " u i (replic 

origin), respectively* tpt • « 

telflraAra . y ' TEL ' se ^ uence that seeds 

telomere formation in yeast- ev R. 

y r * exR - former EcoRI 

££ii- The arrow indicated *~ 
ARG4 transcription. Erection of 

The CEN3 -ARS1 element used in p TREND A is not 
Che preferred sequence to u «„ * 

YAC libraries To constructing DNA 

„, f " ^ T ° conv «t pTKENDA to the more 
Preferred derivative, pTKENDA i, a- 

- ...... ««, kLl: r ;::r:;* E d vi \ h xbai 

cut with R^mUT j UJNA ls then 

Xth BamHI » ^oPPing out the CEN3 -ARS1 •! 

.« ..... fc „ PI12B :zt 

tne TEL sequence. The 6 5 kfc 

L. coll dha , tu£f . t '"""""> "»)"«! APG4, 

Ph.mc.x~ t « 8 :;/ 1 r ; e "r -* •««-- 

e gene ls gel purified 
Separately. pTKENDA is digested with HindT T t 
30 BamHI and the 0.7 kb TEL I, Hindlll and 

b in this 1 fi ::r- (ref : rred to as 

oaincation) is gel purified 
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Hi „„ r (ATcc #37364) is digested 

Hmdlll, Pvull, and Xbal and the 2 . 6 kb Hindlll- 
PvuII fragment carrying CEN4 and ARS1 is gel 
Purified (referred to as fragment C in this modifi. 
cation). Fragments A, B, and C are ligated 
together, transformed into E . c o 1 i . and chloram- 
phenicol resistant colonies are screened for 
P1...I-. with a single copy of f rag ments A , B. and 
C The resulting plasmid is pTKENDA2 . 
A • 2 . c ) cons true £ i on_of _a_YAC_ARM_VECTOR 



_£tKENDB- 



-37060,. cerr y i„ g ch. y ... t „„ 

»>ch 14 DM poller... . n4 „ Hlnc „ 1U "" d 

pOClc (AICC .37254) On. .7 .„ MlncI1 
. , 0ne Plasmid, pT32H. i. 

Isolated in „hieh the direction „f . 

Che TPP1 . ■ ° f tr «»stciptien of 

the IRP1 g.»e x . directed .». y £r<1 „ the EcoPI eit . 
of the p„c lS po lyl i„ k ,r. Ihls plasmid ls 1 * « 
EcoPi and ,„ HI . „„„ led ou6os 3 « 

«.>, a „d treated ,.,„„ti.l ly „ lth T4 „ M 
Ui.ee, T4 MA pe„.. t .„, .„„ „ »" 

treated .ol.onX.e ere tr.„ s£ or..d into Ecu end 

":;:^^tidt;"„7::;" a the — 

„. sequence analysis. One 

pl.e.id . th the correct ..,„.„„ J" 

purified for further use. 

Plasmid PBS/+ (Stratagene Cloning Sys tems 
-0Ua, CA> is cut With AatXl and EcoVand M unted 
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with T4 DNA polymerase to delete the LacZ gene. The 
resulting molecules are circularized with T4 DNA 
ligase and ampicillin- resistant E. coli trans- 
formants are analyzed for the correct deletion 
derivative which regenerates the EcoRI site. One 
plasmid (pBSA) is cut with EcoRI and PstI (both of 
which cut within the pBS/+ polylinker) , and ligated 
to the 0.85 kb TRP1 EcoRI-PstI fragment from pT32LH. 
Ampicillin-resistant transf ormants from this 
ligation are screened by restriction enzyme analysis 

f.o.r_moXec.ules-w-i-feh-^ 

PT32BH is then cut with BamHI-XhoI TEL fragment from 
pTKENDA, and transf ormants are screened by 
restriction (Section A.2.b.) enzyme analysis for 
molecules with a single insert of the TEL fragment. 
This plasmid, pT33H. is cut with SphI , blunted by 
treatment with T4 DNA polymerase and ^circularized 
with 14 DNA ligaae. The resulting plasmid is pT34H 
PT34H is digested with SnaBI and BamHI, and ligated 
to the 1.2 kb SnaBI-BamHl fragment from plasmid 
pBRr^a (ATCC #39698). The resulting plasmid is 
designated pTKENDB . Figure 6b is a plasmid map of 
PTKENDB, with relevant features and restriction 
enzyme recognition sites: N: Nail; A : Apal ; Sn - 
SnaBI ; B: BamHI; Hd: HindHI; X: X hoI; R: EcoRI- 
Xb: Xbal; He: Hindi; s P : SphI ; P: PstI; TRP1 . 
yeast TRP1 gene; A P : ampicillin resistance gene- 
0RI(pBS/ +): P BS/ + origin of replication; ARSc: 
consensus ARS sequence (TAAACATAAAA ; Braoch, J et 
al., ^ld_S P rin^Harbor_Syap. Quant. m 47:1165" 
(1983)). TEL: sequence that seeds telomere 
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generate EcoRI overhanging ends. The 2237 bp EcoRI. 
lxnked Xmnl-Styl fragment is purified by gel 
electrophoresis. 

BamHI linkers are added on to the 1.1 kb 
HindUI fragment from YI P 30 (ATCC #37109) that 
carries the URA3 gene. This fragment is inserted 
into the BamHI site of pBS/+ (Stratagene Cloning 
Systems, LaJolla, CA)> such that orientation 
URA3 transcription is directed away f ron the EcoRI 
sxte in the polylinker. The resulting plasmid is 
cut with HindUI, blun ted with T4 DNA polymerase 

TZZT^TT** with T4 DNA lisase to rem °- ^ 

Hxndlll sxte of the polylinker. The resulting 
Plasmid is cut with Nsil and Sail, blunted with T4 
DNA polymerase, and recirculated with T4 DNA 
US... to remove the Nsil, BamHI (3' side of T,R A 3 
only . Xbal, and SalI sltes in ^ ~3 

resulting plasmid is cut with EcoRI and BamHI and 
annealed with Olieos S a „A c ^ 

The . Ig ° S 5 and 6 show n in Figure 8b. 

The ffllxture is treated ^ 

Polymerase, and again with T4 DNA ligase, and 
tra f orffied lnto bacteria Affipicillin . resi 

transformants are screened by restriction enzyme 

analysis for the presence of an Apal sit 

with the polylinker and , , introduced 

and EcoRI site * ^ "generate 

ana fccoRl site are subject to dideoxv n Bl 

to confirm the correct polylinker 

Plasmid is PURA3LH. P ° lyllnker This 
The host strain XLl-Blue rst-7- a -» 
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and a mixture of wild-type and pURA3LH phage 
particles is isolated. Cells from the dut ung E. 
coli strain CJ236 (Bio-Rad Laboratories, Rockville 
dntre, N.Y.) are infected with this mixture of 
phage, and a mixture of puRA3LH and M13 single- 
stranded DNA is isolated. Oligonucleotide 12 
(Figure 8c) is used essentially as described by 
Kunkel (Kunkel. T.A.. Proceedings of the National 
Academy of Sciences (USA). 82:488-492, 1985) to 
in troduce a base substitution at the XhoII site at 



position 906 in the published URA3 sequence (Rose M . 
Grisafi, et al. . Gene, 29:113-114). The resulting 
plasmid, pURA3LHX" , is cut with EcoRI and BamHI, and 
ligated to the 0.7 kb EcoRI - BamHI TEL fragment from 
pTKENDA (Section A.2.b.). The resulting plasmid, 
pT42H. is cut to completion with EcoRI and partially 
with PstI, blunted with T4 DNA polymerase, ligated 
to EcoRI linkers (CGGAATTCCG) , and cut with EcoRI to 
generate EcoRI overhanging ends. The 1.7 kb EcoRI - 
linked fragment is purified by gel electrophoresis 
and ligated to the EcoRI-linked fragment from pT41H 
purified above. Tetracycline resistant trans- 
formants are analyzed by restriction enzyme analysis 
for molecules with a single copy of each fragment in 
either orientation. This plasmid is digested with 
BamHI and Smal and the same 1 . 8 kb stuff er fragment 
derived from E^. coli used in the construction of 
pTKENDA is inserted. The resulting plasmid is 
designated pTKENDC. Figure 6c is a plasmid map of 
pTKENDC. with relevant features and restriction 
enzyme recognition sites. N: Nsil; A: Apal ; Sm: 
Smal; B: BamHI; Hd: HindHI; X: XhoII; R: EcoRI; 
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Ah: Ahalll; URA3 : yeast URA3 gene; Tc : tetra- 
cycline resistance gene; ORI (pACYC184) : PACYC184 
origin of replication; ARSc: consensus ARS sequence 
(TAAACATAAAA; Broach, J. et_aL , (1983) Cold_S£ring 

5 Harbor Symp. Quant. Biol. , 4 7:1165). TEL : s equenc e 

that seeds telomere formation in yeast; exS, exM, 
exN, exP, exB , exX: former Styl, XmnI , Nsil, PstI, 
BamHI, and XhoII sites, respectively; dashed line: 
stuff er DNA fragment derived from E_^ cpli . The 

10 arrow indicates the direction of URA3 transcri ption. 

A . 2 . e ) Construe t i o n_o f _a_YA C_A rm_V e ctor 

pTKENDD 

PACYC177 (ATCC #37031; Chang, A.C.Y. and Cohen, 
S.N. Journal of Bacteriology , 134 : 1141-1156 , 1978) 
*5 is cut with Sau96, blunted by treatment with T4 DNA 

polymerase, and the 1.2 kb fragment carrying the 
kanamycin resistance gene is isolated by gel 
electrophoresis. This fragment is ligated to Hindi 
cut pBS/+ (Stratagene Cloning Systems, LaJolla, CA) 
20 and chloramphenicol and kanamycin resistant clones 

are analyzed by gel electrophoresis for recombinants 
with the kanamycin gene inserted such that the 
direction of transcription is directed away from the 
EcoRI site in the pBS/+ polylinker. The blunt- 
25 ending of the Sau96 sites and subsequent ligation to 

Hindi cleaved pBS/+ results in Sail sites at the 
left and right junctions. This plasmid is pTSOH. 
To remove one of the two inverted repeats flanking 
the kanamycin resistance gene (the 5' inverted 
50 repeat relative to the direction of transcription), 

pT50H is cleaved with Sail and Drain and the 1.08 
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kb fragment containing the kanamycin resistance gene 
is purified, blunt-ended by treatment with T4 DNA 
polymerase, and ligated to Hindi digested pBS/+. 
The resulting plasmid, with transcription of the 

5 kanamycin resistance gene directed away from the 

EcoRI site in the pBS/+ polylinker. is pT50ASD. 
pTSOASD is introduced into the host strain XLl-Blue 
(Stratagene Cloning Systems, LaJolla, CA) , and 
subsequently infected with wild-type M13 (Bio-Rad 

10 Laboratories, Rockville Centre, New York) and a 

mix ture oi w_i 1 r).-_<-_iT_T>.«. - - j __n>_=./v. 

-->r- yi-iwzu pnage particles are 



25 



isolated. Cells from the dut'ung" E . coli strain 
CJ236 (Bio-Rad Laboratories, Rockville Centre, N.Y.) 
are infected with this mixture of phage, and a 
15 mixture of pTSOASD and M13 s ingl e - s trended DNA is 

isolated. Oligonucleotides 14, 15 and 16 (Figure 
8c) are used essentially as described by Runkel 
(Kunkel. T.A..) Pr ££eedin £ s_of_the_National_Academv 
5l_Sciences_IJSA, 82:488- 92, 1985) to introduce bale 
20 substitutions at two Nsil sites (positions 2203 and 

2469 of the published pACYC177 sequence) and at an 
XhoII site at position 2602 of P ACYC177. The 
resulting plasmid, pTSOHX is cut with EcoRI and 
Sphl, blunted with T4 DNA polymerase, and cir- 
cularized with T4 DNA ligase, (regenerating the 
EcoRI site). The resulting DNA preparation is then 
cut with Xbal. This fragment is ligated to the 882 
base pair AccI-XhoII fragment of pACYC177 (which has 
been blunted with T4 DNA polymerase, ligated with 
Xbal linkers (GCTCTAGAGC) , and treated with Xbal to 
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generate Xbal overhangs) carrying the plasmid origin 
of replication, to generate plasmid pT51H (either 
orientation will suffice) . 

Plasmid pT52H is constructed by cutting plasmid 
5 YIp33 (ATCC #37064) with Hpal and AccI to release a 

1.6 kb fragment containing the yeast LEU2 gene 
(Andreadis, A., et al., Cell , 31:319-325, 1982). 
This fragment is blunted with T4 DNA polymerase and 
ligated to pUC18 (ATCC #37253) cut with Hindi. The 
20 resulting plasmid is cut with BamHI and Xbal, and 

anjiea.Led_w-ixb__oXi-gon^ucleotides_7— and— 8 (-F-igure 8 b-)-. 

The mixture is treated with T4 DNA ligase, T4 DNA 
polymerase, and again with T4 DNA ligase, and 
transformed into bacteria. Arapicillin resistant 
15 transf ormants are screened by restriction enzyme 

analysis for the presence of an Apal site introduced 
with the polylinker and plasmids that regenerate a 
BamHI site are subject to dideoxy DNA sequencing to 
confirm the correct polylinker sequence. The 
20 resulting plasmid is pT52LH. pT52LH is digested 

with BamHI and PstI, and the gel purified 1.6 kb 
fragment is ligated to pT51H cut with BamHI and 
PstI. The resulting plasmid, pT53H, is digested 
with Seal and Bglll , and ligated to the double- 
25 stranded oligonucleotide shown in Figure 8c (oligo- 

nucleotides 9A and 9B) . The resulting plasmid 
(pT53HL) is partially digested with Hindlll, 
followed by complete digestion with Bglll and the 
digestion product corresponding in size to 
30 linearized pT53HL (approximately 3.7 kb) is 
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purified. This species represents cleavage at the 
adjacent HindHI and Bglll sites introduced via 
Oligonucleotides 7 and 8 (Figure 8b). Plasmid 
pTKENDA (Section A.2.b and ATCC accession number 
40833) is digested with EcoRI and treated with the 
Klenow fragment of E . coli DNA polymerase to 
generate a blunt end. This DNA is then digested 
with BamHI and the 0.7 kb TEL fragment is gel 
purified. Plasmid YCpl9 (ATCC #37364) is digested 
with HindHI, Pvull, and Pvul and the 2 . 6 kb 
Hindlll-Pvull fragment car rying CEN4 and ARS1 is ge l 
— p-ur-r-f-i-e-d^-T-he-puri-f-i-w-d-CEN-A-ARSl and TEL fragments 
are ligated to Bglll-Hindlll digested" P T53HL and 
transformed in E. coli. Kanamycin resistant trans- 
formants are screened for plasmids with a single 
copy each of the CEN4 - ARS1 , TEL , and T53HL frag- 
ments. The resulting plasmid is pT5 4H. pT 54H is 
digested with Pvull and Sad, and ligated to 

25 ^ ;/ VU11 fraSIDent lying b ™ ^s, tions 
25.881-27,414 on the bacteriophage Lambda (New 

England Biolabs, Beverly, MA) aa p . The resulting 
Plasmid is pTKENDD . Figure 6d is a plasmid map of 
pTKENDD with relevant features and restriction 
enzyme recognition sites. N : ».„. A: 
BamHI ; He: HincIII; Pv: Pvull; p. PstI , 
SalKHincIl); «d: HindlH; X: XhoII; Xb :' Xbal • 

Saci; Ah: Ahalll; LEU2: yeast LEU2 gene; Km: 
Kanamycn resistance gene: OEI ( P ACYC177) : pACYC177 

" ° f "'""""i ARSc: consensus ARS seouence 
(TAAACATAAAA; Broach, J. £ t a 1 . , (1983 ) Cold 
Sarbor.Sy^^uant^Biol, 47 : 11S5) . CEN^RsT 
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CEN4/ARS1 fragment from YCpl9 (see text); TEL: 
sequence that seeds telomere formation in yeast; 
exR, exPv, exN, exX: former EcoRI f PvuII, Nsil and 
XhoII sites, respectively; dashed line: stuff er DNA 
5 fragment derived form bacteriophage Lambda. The 

arrow indicates the direction of LEU2 transcription. 

A . 3 ) Constructio n o f Yeast Artificial 

Chromosome (YAC) Libraries 
DNA from human white blood cells is prepared 
and partially digested with restriction endo- 
nucleases, essentially as described (D.Burke, Ph.D. 
thesis, Washington Univ., St. Louis, MO (1988)). 
DNA (with a desired average size of greater than 1.5 
megabases) is partially digested with Apal , Nsil, or 
any enzyme that leaves a blunt end. To construct 
Library 1, plasmids pTKENDA2 and pTKENDB are used. 
pTKENDA2 is cleaved with BamHI and either Apal, 
Nsil, or Smal to release the stuff er fragment. 
pTKENDB is cleaved with BamHI and either Apal, Nsil, 
or SnaBI to release the stuff er fragment. For the 
construction of Library 2, plasmids pTKENDC and 
pTKENDD are used. pTKENDC is digested with BamHI 
and either Apal, Nsil, or Smal to release the 
stuffer fragment. pTKENDD is digested with SacI and 
either Apal, Nsil, or PvuII to release the stuffer 
fragment . 

Each vector is treated with calf intestine 
alkaline phosphatase under conditions recommended by 
the supplier and purified by phenol extraction and 
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ethanol precipitation. For each library, 50 „g of 
human DNA and 25 MB of each vector in each pair 
(pTKENDA2 -pTKENDB or pTKENDC - pTKENDD ) are mixed and 
ligated using T4 DNA ligase for 2 days at 12'C, in a 
ligation buffer recommended by the enzyme supplier. 
The ligated DNA is size fractionated by Field 
Inversion Gel Electrophoresis (Carle et a 1 . , 
Science, 232; pp 65-68, 1986) in low-gelling temper- 
ature agarose (FMC Corp., Rockland, Maine), or CHEF 
gel electrophoresis (Chu et a 1 . , 1986 o P cit^ 



30 



the portion of the gel containing DNA of 250-450 kb 
is excised and equilibrated with TE buffer + 45mM 
NaCl. 

A . 3 . b ) iiansfor 5 atio 2 _oLYeast_S E her ££ lasts 
^ilh_DNA_Li £ ated_to_YAC_VectoLAr2s 
S2d_Selection_of_Yeast_Cells_ca 
^I^ili£i a 1_ Chromosomes 
DNA prepared as desc^ibed^n section A. 3. a. can 
be used to transform a haploid S. cerevisiae strain 
carrying chromosomal deletions for ARG4 , TRP1 URA3 
and LEU2 to arginine and tryptophan prototrophy 
using human DNA ligated to pTKENDA2 and pTKENDB es- 
sentially as described by Burgers and Percival 
(1987), with the following modifications: 10 - 20 
Pi of the low-melt agarose carrying the DNA is 
melted at 68'C for 3 to 5 minutes. Carrier DNA 
(sheared salmon sperm or calf thymus DNA) is added 
to the cells to a final concentration of 30 - 40 
/*g/ml immediately before 200 M l of cells is added to 
the melted gel slice. 
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For plating and selection of yeast cells 
carrying artificial chromosomes, transformed cells 
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are mixed with top agar (1M sorbitol, 2% dextrose, 
0.5% ammonium sulfate, 0.17% yeast nitrogen base 
(Difco), 2.5% Bacto-agar (Difco), 0.005% adenine 
sulfate, and supplemented with uracil and all of the 
amino acids listed in Table 13.1.1 of Ausubel et a 1 . 
(An sub el et al. , Current Pro toco 1 s in Mole cul ar 
Biology, Greene Publishing Associates and Wiley- 
-I-n-te-r-s e-i-enc e ,— New— y.o.rk-,— l-9-8_7_)_a-t_th.e_li.S-te ; d_c.o.nc.e.n^_ 
trations, but omitting arginine and tryptophan for 
selection. The mixture of cells and. top agar is 
poured onto the surface of agar plates made 
identically to the top agar except that the final 
concentration of agar is 2% in the plates. Plates 
are incubated at 30° C for 5-7 days. 

To construct Library 2, human DNA ligated to 
pTKENDC and pTKENDD are used to transform the same 
S . cerevisiae strain to uracil and leucine pro to - 
tropy. Top agar and plates are prepared as des- 
cribed above, but lacking only uracil and leucine. 
A . 3 . c ) Pooling of Clones 




Yeast colonies growing on plates selective for 
markers present on artificial chromosomes are 
transferred using sterile toothpicks into individual 
wells of 96-well microtiter plates filled with 200 
/*1 of selective media. Plates are incubated with 
shaking at room temperature for 2 days and stored at 
4*C for up to one week. A fully representative YAC 
library of the human genome should be comprised of 
50,000 independent clones, assuming an average clone 
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size of 300 kb. This number of clones would fill 
521 micro ti ter plates and is stored as 10 separate 
subpools. When approximately 52 plates are filled, 
100 /il from each well is withdrawn, pooled, and 

5 thoroughly mixed with an equal volume (approximately 

500 ml) of 30% sterile glycerol. The cell density 
of the cells in glycerol should be about 2.5 x 10^ 
cell/ml, and can be adjusted to this density by 
counting cells prior to glycerol addition. The 

10 pooled cells are then aliquoted into microcentrifuge 

tubes in volumes of 0.1 to 1 ml, set on dry ice to 
quick freeze, and stored at -70°C. This is repeated 
for each of the 10 separate subpools. 

B . ) Iran s forma t i on_o f _P £ o 1 e d_L ib r ary_l_w i th_a_Tar ge t i n£ 
15 Il££Sid_and_Selection_of_S£ecif i £_Ar t i f ic i al 

C^r OTOo^ome_C l^o^ne 

The isolation of DNA YACs by homologous 
recombination is illustrated in Steps 1 and 2 of 
Figure 4. 

20 B . 1 . ) ££££* J.H£i i£n_£f_th e_T ar ge t ing_P 1 asm id 

The desired fragments of human DNA (the 
targeting sequences), previously identified as being 
unique or at low copy number in the human genome are 
substituted for the TEL and stuffer domains of 

25 pTKENDC. 50 pig of the resulting subclones are 

prepared and digested to completion with a 
restriction endonuclease which generates a linear 
molecule harboring a double - strand break or gap in 
the targeting sequence, in such a manner that at 

30 least 150 base pairs, but possibly less, of 
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targeting DNA remains on either side of the break or 
gap, and the pTKENDC vector backbone is intact and 
contiguous with the targeting DNA. The digested DNA 
is purified by phenol extraction and ethanol pre- 
cipitation and resuspended in 20 /il. 
B.2.) Transformation of YAC Library 1 with 

the_Tar£eting_Plasmid_a 

of Clones Homogolous^to_the^Tar£Pf--i_n£ 
Segue nee 

0>1 mi of each of the 10 subpools are combined 
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in 100 ml CM -arg, trp selective media supplemented 
to 0.05X YPD. Cells are grown overnight with 
vigorous shaking at 30C to a density of 2 x 10 7 
cells/ml. Cells are prepared for transformation by 
the lithium acetate method (Ito et al., 1983) 
essentially as described (Ausubel et al. , Current 

LL9.^2£9.1^kri^llol^£^lsLL^liSil£.Ey:^ Greene Publishing 
Associates and Wiley- Interscience , New York, 1989), 
and split into six 200 pi aliquots at 2 x 10 9 
cells/ml. 50 pg of each of the linearized targeting 
plasmids (in 20 pi) is mixed with 10 pg (2 pi) 
sonicated calf thymus DNA and added to a 200 pi 
aliquot of cells. After transformation, cells are 
spread onto the surface of CM-arg, trp, and uracil 
agar plates and incubated at 30°C for 3-5 days. The 
omission of uracil from the media selects for cells 
that have stably integrated the targeting plasmid 
derived from pTKENDC. 
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£2£lZ£is_of_Clones 
C . 1 ) Se £ re £ ation_Anal Y sis_of_Clones 

Yeast colonies prototrophic for arginine , 
tryptophan, and uracil are candidates for clones 
5 carrying the targeting plasmid integrated into a 
human DNA YAC with a region of identity to the 
targeting sequences on the targeting plasmid 
Colonies in which the targeting plasmid integrated 
into a YAC are identified by a marker segregation 
10 assay. The loss patterns of the three markers are 
^-^-^-d,r W rom tfii selected clone 



15 



which have lost the YAC after growth on non- 
selective media. Cells are patched onto YPD plates 
and grown non- selectively for two days, replica 
Plated onto a second YPD plate and grown for another 
two days. Cells from the second YPD plate are 
struck-out for single colonies on a third YPD plate 
After three days, the plate with single colonies is ' 
replxca printed onto a CM -arginine, tryptophan 
Plate, and a CM -uracil plate. Clones in which the 
targeting plasmid is integrated into a YAC are 
identified hy their characteristic pattern of 
co-loss of all three markers. i n these cases 
colonies that are auxotrophic for arginine and 
tryptophan (colonies that lost the markers identi- 
fyxng the YAC) are also auxotrophic for uracil. 

of_Cl one s 
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generate the double-strand break in the targeting 
sequence. 1 Mg e f the digested DNA is subject to agarose 
gel electrophoresis and Southern transfer and probed with 
32-P labeled DNA corresponding to the fragment of the 
5 URA3 gene carried in pTKENDC . As a control, 1 ng of the 
digested plasmid generated in B.l) above is run alongside 
the yeast DNA samples. A correctly targeted event is 
characterized by a band on the autoradiograph that 
migrates exactly the same distance as the pure, 
10 linearized targeting plasmid. 

C . 3 ) lH£He_£LClon^lerMni_to_Gener H e_Labeled 

S e^u e n c e s _ t hat_ a r e_ S i n £ 1 e_ C o p y in the'cenn^ 
^H-^iMSa.^te^ination_of_the 
15 2£l£S t a t i on_p f _ci one d_Ins e r ts "rIi a t i y e to 

Vector_Ar I n 1 ^_Generation_of a Targetin^^^ 

f^_Clone_Termini^and^ransforma 
Pooled_Library_2 

The YAC cloning vectors pTKENDA2 , pTKENDB , pTKENDC 

2(J and pTKENDD have been designed specifically to facilitate 

the rescue of cloned DNA from the ends of DNA YACs by 

sample microbiological techniques. One or more 

recognition sites for restriction enzymes that cut 

mammalian DNA relatively frequently (approximately once 

25 -ery 0.5-1.5 kb) are positioned at the junction between 

the bacterial plasmid replicon and the yeast telomere 

rrlll ° r yea " repliCati ° n ori 6- CARS) and centromere 
(CEN) sequences. For any one of the four ends 
recognition sites for a subset of such enzymes' are not 
30 found at any other position in the plasmid replicon 
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the yeast selectable marker on that end, such that 
cleavage of total yeast DNA isolated from cells carrying 
a particular DNA YAC with one of these enzymes rescues 
(as illustrated in step 3 of Figure A) DNA from the 

5 cloned insert covalently linked to the yeast selectable 
marker and bacterial r ep 1 icon ,* but free of yeast chromo- 
some replication and stability elements (telomeres, 
centromeres, and yeast replication origins). This 
"rescued" DNA is used as the targeting plasmid for the 

10 second DNA YAC library. Column 2 of the Table (RESCUE 

SITES) lists the restriction enzymes useful for rescuing 
cl oned DNA adjacent to each of the four ends in the two 
DNA YAC libraries. Column 3 (ADDITIONAL ENZYMES) lists 
some of the additional enzymes that can be used in con- 

15 junction with the enzymes listed under RESCUE SITES in 
the event that a RESCUE SITE enzyme rescues a very long 
sequence containing a repetitive DNA element that might 
prevent the clone from being useful for selecting DNA 
YACs by homologous recombination. 
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TABLE 



YAC END 


RESCUE 


»J J- X £« O 


ADDITIONAL ENZYMES 


PTKENDA2 


Hindi 


(1433) 


PstI 


(3169) 




Hindlll 


(1844) 


Xhol 


(21462) 




SphI 


(4522) 


EcoRI 


(2669) 








BamHI 


(5604) 








Kpnl 


(8902) 








StuI 


(3872) 








Avail 


(790) 








Hpal 


(4240) 


pTKENDB 


Hindi 


(1433) 


Xhol 


(21462) 




EcoRI 


(2669) 


Tthllll 


(1070) 








Styl 


(785) 








BamH 


(5604) 








Kpnl 


(8902) 








StuI 


(3872) 



10 



15 



pTKENDC 



20 



Ahalll 

BstYI 

EcoRI 



pTKENDD 



25 



Ahalll 

BstYI 

BamHI 



Hpal (4240) 

(1192) Tthllll (1070) 

(930) Xhol (21462) 

(2669) BamHI (5604) 

Kpnl (8902) 
Hpal (4240) 

(1192) HgiAI (1348) 

(930) Hpal (4240) 

(5604) SphI (4522) 



The numbers in parentheses represent the average number of 
base pairs between restriction sites calculated for 
mammalian DNA . 
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The recovery, analysis and use of clone termini for 
recombination walking is illustrated in Steps 3-6 of Fig. 
4, U: yeast URA3 gene; X: restriction enzyme cleavage 
site used to make targeting break; striped box: 
5 targeting sequence; thick lines: plasmid sequences for 
propagation and selection in E. coli; Ap : ampicillin 
resistance gene; Cm: chloramphenicol resistance gene; T: 
yeast TRP1 gene; A: yeast ARG4 gene; solid circles and 
horizontal arrowheads: yeast centromere/replication 

1 0 ori g ins and telomer es. r.e.s.t>.e.c.ti _v<0 ... . 

human DNA in Library 1; Y : restriction enzyme cleavage 
sites used for end-rescue; L : yeast LEU2 gene; Km: 
Kanamycin resistance gene; Tc : tetracycline resistance 
gene; Z: restriction enzyme cleavage site used to make 
15 targeting break in end-rescued DNA; thick shaded line: 

cloned human DNA in Library 2. The thin line in Library 
2 DNA represents a sequence homologous to end-rescued DNA 
from Library 1. 

The remainder of the discussion will relate to 
20 isolating (rescuing) the left-hand end of the YAC, but 
the principles can be extrapolated for homologous 
recombination walking using any of the four ends in the 
two DNA Libraries. The vertical arrows marked "Y» can 
represent the positions of Hindi sites lying at various 
25 positions throughout the human DNA (for mammalian 

genomes, Hindi sites have an expected distribution of 1 
site/1.4 kilobases). The vertical arrow on the extreme 
left side indicates the position of a Hindi site that 
separates the TEL element from the TRPl-pBSA element. 
30 Cleavage of total DNA from the yeast strain carrying the 
YAC illustrated will release the TRPl-pBSA fragment from 
the TEL sequence on the left side, but the right side 
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will regain attached to a fragment o£ cloned DNA 
extending to the first Hindi site within the xnsert. 
The total DNA is ligated under conditions which promote 
"rcularization of fragments. A fraction of this » 
US ed to transfer, bacterial cells to isolate ampicillxn 

resistant plasmids. 

^proximately 60 „g of plasmid DNA is purified, and 
several micrograms are digested with Hindi 
enzyme used to digest the genomic DNA constituting the 

. „__t_ v Mc -iT->. If Library 1 was con- 

library ^anao*. > — - 



10 



20 



25 2 



30 



structed by cleaving genomic DNA with Smal and ligated to 
the SnaBI digested pTKENDB , then an enzyme other than 
Sma I or SnaBI which flanks the cloning site must be used 
(for example, Apal or Nsil) . The digest is fractionated 
on an agarose gel and the non-YAC vector fragment the 
rescued insert) is purified and a fraction is labeled 
with "-phosphorus or chromogenic nucleoside triphos- 
phates. This DNA is used in three different ways: 
! The DNA is cut with a selection of restriction 

enzymes that are known not to cut within the TRP.l 
pBSA sequence- (ADDITIONAL ENZYMES in the Table 
among others can be used). The digestion products 
are analyzed by gel electrophoresis to identify 
restriction enzymes which will cut the cloned DNA 
isolated from the end of the YAC. 

The labeled DNA is used to probe a Southern blot 
filter of human and yeast DNA to determine if the 
end of the YAC corresponds to a single copy sequence 
in the human genome, or if it is homologous to the 
yeast genome. Human sequences that are single copy 
or low copy and not homologous to yeast DNA are 
preferred for targeting. 
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3. The labeled DNA is used to probe a dot-blot, In 

which total DNA from yeast cells carrying YACs have 
been isolated and fixed to a Nylon membrane. The 
membrane is spotted with DNA from the YAC that the 
labeled DNA is derived from (YAC-2) , the YAC over- 
lapping with YAC-Z which is used to isolate YAC-Z in 
the previous recombination selection step (YAC-Y) . 
and the YAC overlapping with YAC-Y which was used to 
isolate YAC-Y in the previous recombination 
selection step (YAC-X) [i.e., the last three YACs 

isolated in the walk]^ Hy.b.r.i.di.z.a.t.i.on-onl-y-t-o-t-he 

YAC from which is derived (YAC-Z in this case) 
indicates that the TRPl-pBSA end of YAC-Z extends in 
the correct direction, away from the YACs Y and X. 
This is confirmed by a similar analysis with the 
other end of YAC-Z, which must hybridize with YAC-Z 
and YAC-Y and/or YAC-X. 

A targeting plasmid meeting the criteria outlined in 
2) and 3) above is cleaved with an appropriate restric- 
tion enzyme (identified from 1 above) and as denoted as Z 
in Figure 4), and used as the targeting plasmid to 
isolate clones from Library 2, as described in Section 
B.2 above. 

EXAMPLE_IV ^^Zis^or^^^en^^the^currsnce 
^-^21^^ve_lnters2,ersed_DNA_at_D^A 

The vectors described in Example III incorporate 
novel features that are specifically designed to 
facilitate chromosome walking. First, the two ends of 
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the artificial chromosome are derived from two different 
plasmids, each with its own sequence to seed telomere 
formation in yeast, a bacterial origin of replication, a 
gene for resistance to an antibiotic for selection in E . 
coli , and a selectable gene for clone selection in yeast. 
5 This system allows either end of the YAC to be isolated 
as a bacterial plasmid for amplification and use in each 
walking step, as opposed to the possibility of isolating 
only one end with existing YAC vectors. 

In the preferred embodiment of any walking strategy, 
10 extreme end of a clone is used as a probe to isolate 

overlapping clones in the walk. The usefulness of such a 
probe is limited by the presence of repetitive DNA which 
may be homologous to thousands of clones within the 
library. Members of the class of DNA sequences termed 
25 highly repetitive interspersed are found at thousands of 
discreet locations throughout the human genome. 
Specifically, a member of the Alu family of repetitive 
DNA sequences is found, on average, spaced at 1 to 3 
kilobase intervals throughout the genome (Moyzis, R.K., 
20 et a 1 - , Genomics , 4:273-288, 1989). 

The methods and vectors described in Example III 
have been designed to minimize the occurrence of 
repetitive DNA at the terminus of the DNA clone inserts 
in a human DNA YAC vector library. The first feature 
25 incorporated into the vector library design is the use of 
a specific set of restriction endonucleases to cleave 
human DNA. Numerous DNA sequences from the Alu and LI 
family of repetitive DNA were analyzed using computer 
programs that identify recognition sites for restriction 
30 endonucleases. The results of this analysis revealed 
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that recognition sites for the restriction enzymes Apal , 
NsiI T and Seal are not found in the published consensus 
sequences for any of the Alu subfamilies, and are found 
only rarely in sequenced members of the LI family (of 
approximately 30,000 base pairs of sequences LI DNA 
analyzed, there were only five sites for the three 
enzymes listed above; 23 sites would be expected based on 
the dinucleotide frequencies found for human DNA) . These 
two families alone account for approximately 10% of the 
mass of the huma n genome. indicating that— as— asa-n-y— as— one 



10 in ten clone ends (1 in 5 clones) may terminate within 

one of these repetitive sequences. By using the enzymes 
disclosed above to cleave human DNA, one creates an 
inherent bias against the occurrence of these two 
repetitive sequences at the ends of clones. 
15 The second feature incorporated into the design of 

the YAC cloning vectors to minimize the occurrence of 
repetitive DNA in targeting probes used for walking is 
limiting the size of the DNA probe fragment rescued from 
the DNA clone end. Smaller DNA fragments have a lower 
20 probability of containing repetitive DNA. The vectors 
described in Example III have been designed to rescue 
fragments of human DNA on the order of 1-2 kb in length 
by a single restriction enzyme cleavage of the YAC clone. 
This is accomplished by the insertion of a polylinker 
carrying recognition sites for multiple restriction 
enzymes which cut, on the average, once every 0.5 - 1.5 
kb. When total DNA from yeast carrying the YAC is cut 
with one of these enzymes, a fragment of DNA containing a 
plasmid origin of replication and a drug resistance 
marker (for propagation and selection in E. coli) , as 



25 



30 



WO 93/03183 



PCT/US91/08679 



-80- 



wcll as a gene for selection In yeast, and approximately 
1-2 kb of human DNA will be released. This fragment can 
be circularized and transformed into bacteria. As 
expected, the recognition sites for enzymes that are most 

5 useful for this step are found within several of the 
elements used in the construction of the proposed YAC 
cloning vectors. In vitro mutagenesis to delete restric- 
tion enzyme cleavage sites, along with the judicious 
choice of combinations for the two plasmid replication 

X0 origins - ; the f our - dTug^re'si'stance - markers - ; and - the f our 
yeast selectable markers is used to create vectors 
lacking the frequent- cutting res trie tion.- enzyme cleavage 
sites listed in the Table (Rescue Sites) . 

15 Yeast Ar tifi cial Chromosome Clones_for the Isolation 

of Clones_Known_ to be Present_in_a_Yeas t Ar tfjfic ial 

Feasibility of Llbrary_ScreeninR by Homologous 
Re combination 

We used homologous recombination screening to 
extract a clone from the library that was known to exist 
within the library. Since the vector arm containing the 
TRP1 gene in YACs constructed with pYAC4 contains a 
plasmid replicon and a selectable marker (the beta- 
lactamase gene conferring ampicillin resistance) , the 
technique of "plasmid rescue" was used to isolate 
terminal fragments from two YACs constructed in the 
vector pYAC4. The restriction enzyme Xhol cleaves at a 
single site within the TRP1 vector arm, at the junction 
between the telomere and pBR322 sequences. Complete 
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digestion of YAC DNA with Xhol should produce a restric- 
tion fragment devoid of telomeric sequences, containing * 
functional plasmid replicon and Amp r marker, and har- 
boring a segment of human DNA that was adjacent to the 
vector arm in the original YAC clone and extends to the 
5 terminal Xhol site in the human DNA insert. 

A group of 161 YACs within the library were con- 
structed using the host yeast strain MGD131-10c (genotype 
a leu2-3,112 ADE2 cyh2 r hisAl trpl-289 agr4A ura3-52). 
Total DNA from two clones in this group was 
10 Xhol, ligated under dilute conditions to promote intra- 
molecular circularization, and transformed into E. coli 
(all steps carried out essentially as described in 
Ausubel et a 1 . , 1988 [above]. Plasmid DNA was isolated 
from ampicillin resistant colonies and subjected to 
15 restriction enzyme analysis. One human DNA fragment from 
each of the two rescued plasmids was subsequently blunt- 
ended by treatment with T4 polymerase and ligated into 
the Smal site of p 1 84DLARG . The fragments, 10B and 8A , 
are 1 and 4 kb fragments, respectively, of human DNA 
lying adjacent to the TRP1 vector arms in two different 
YACs. The resulting constructs (plasmids pl84-10B and 
P184-8A) were digested with a number of restriction 
enzymes which do not cleave pl84DLARG to identify an 
enzyme that would cut within the human DNA to promote 
targeting. 20 „g of each construct was digested with the 
appropriate targeting enzyme and used for library 
screening, essentially as described in Example 1. 
Fragment 8A contains a single Kpnl site lying 2 -8 Kb from 
one end and this enzyme was used to introduce a unique 
double strand break within the inserted sequence in 
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P184-8A. Fragment 10B contains a single Avail site lying 
0.5 Kb from one end and this enzyme was used to introduce 
a unique double strand break within the inserted sequence 
in pl84-10B. 

Eleven arg + colonies resulting from screening with 
5 clone 8A were isolated and analyzed. Similar to strain 
lV-16d (Example 1 and ATCC Accession No. 74010) strain 
MGD131-10C carries a 2 kb deletion encompassing the 
entire ARG4 gene. However, the two strains differ with 
regard to their LEU 2 genotype; IV-16d is le u + anH HGULZL- 
10 10c has a leu" phenotype. Seven of the eleven colonies 
displayed a leu" phenotype, suggesting that they indeed 
represented independent isolates of the original YAC from 
which clone 8A was derived (a very strong possibility 
since strain MGD131-10c is the host for only 161 out of 
the 11,625 YACs (1.4%) in the library). Seventeen arg + 
colonies resulting from screening with clone 10B were 
isolated and analyzed. Three of the 17 colonies dis- 
played the leu' phenotype. The presence of the leu- 
marker strongly suggests that these clones represent 
isolates of the original YAC from which clone 10B was 
derived. 

DKA was prepared from each of the seven leu- 
colonies isolated by screening with clone 8A as well as 
one of the leu + colonies. DNA was digested with the same 
enzyme used to linearize the transforming DNA molecule 
(Kpnl). A Southern blot of these digests were probed 
with 32-P labeled ARG4 DNA. As described in Example 1 
homologous integration events should reveal hybridization 
to a single fragment of exactly the same size as the 
linearized transforming DNA molecule (referred to in 
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Example 1 as a Unit Length Linear Fragment, or ULL) . 

the eight clones analyzed, all seven in strain MGD131-10c 

(the leu* colonies) represent homologous events, while 

the single leu + transformant analyzed (lane 8) does not 

(Figure 9). Thus, seven out of eleven candidate clones 

isolated were correctly targeted events. A similar 

analysis was performed on each of the three leu* colonies 

isolated by screening with clone 10B. All three clones 

displayed a ULL upon Southern blot analysis, while 14 

leu transf ormants did not. 

To co nfirm that t-K« t- *._»=. «..»_v. 

' "vi«"iugous events isolated 
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e> " ~ •-' *-v*5nuo x u 

by screening with clone 10B and the- seven homologous 
events isolated by screening with clone 8A represent the 
independent isolates of the same YACs, we have mapped the 
termini of the YACs in these ten clones. Figure 10 shows 
the result of this analysis. Three bands are evident in 
each lane, corresponding to the ULL, the left arm, and 
the right -arm of the YAC . The bands migrate at identical 
positions in all seven YACs isolated with 8A , and at 
different, but identical positions in all three YACs 
20 isolated with 10B. These data show that the distance to 
the nearest Kpnl site at each end of the seven 8A YACs is 
identical, while the three 10B YACs display similar 
behavior for the positions of their terminal Avail sites. 

EXAMPLE_VI Screenin £ _of_a_Human Yeast Artif^ 

Chromosome_Libra^v_bv_Homolo £ ous 
Recombination_to_Isolate_a_Yeast 

£^-^2£n_AdenosiiieJeaminase_I.ocus~ 
Synthetic oligonucleotides 06 and~o7-2* were'used in 
the polymerase chain reaction to amplify a 1,376 base 
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pair fragment of the human ADA gene corresponding to 
positions 34,243-35,618 (Genbank Entry HUMADAG) from 
total human genomic DNA isolated from peripheral blood 
leukocytes. The amplified fragment was digested with 
Pstl and the 852 base pair subfragment corresponding to 
5 HUMADAG positions 34,349-35,201 was isolated and cloned 

into the Pstl site of plasmid p 184DLARG (Example 1) . One 
insert orientation was chosen (that with HUMADAG 34,349 
position adjacent to the 3' end of the yeast ARG4 gene in 

tO fl /, nt A V n TV. a r o c?_vi_1_"»^j?_"r» rr -rO_o «r *.t r- A « ~ A O f\ 

10 micrograms was linearized at the unique EcoNI site within 
the human ADA insert (corresponding to HUMADAG position 
34,657) prior to transformation into the pooled YAC 
library. Transformation of the pooled YAC library was 
performed exactly as described in Example 1, with the 

25 exception being that the YAC library consisted of an 
additi onal 3,585 clones, for a total of 15,210 clones 
representing approximately 1.2 genome equivalents. 

Four arg + transf ormants were isolated. Three of 
these are displayed in figure 11 and all three displayed 

2o a unit-length linear fragment upon restriction enzyme 
digestion with EcoNI and Southern blot analysis. 
Analysis of the fourth arg + transformant confirmed that 
it carries the same insert as YAC 184ADA.C and 184ADA.D. 
All four transf ormants harbor a similarly sized YAC of 

25 ca. 200 kb , as judged by CHEF gel electrophoresis. The 
intensity of the ULL band in DNA prepared from YAC 
184ADA.B and other data indicate that YAC 184ADA.B has 
undergone multiple tandem integrations of the targeting 
plasmid. 
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Comparison of a representative YAC, YAC 1 84ADA . C , 
with hu-an genomic DNA by restriction enzyme and Southern 
hybridization analysis using multiple probes and 
restriction digests confirmed that this YAC indeed 
contains sequences from the human ADA locus. 

OLIGONUCLEOTIDE ©6 

10 20 
5' AGATCTGTTT GAGGCTGCTG TGAG 



Bases numbered 1-24 corresponding to positions 34,243- 
10 34,266 in GENBANK Entry HUMADAG . 

OLIGONUCLEOTIDE o7-2 

10 20 
5 ' AGATCCGGCA ACTTGTAGTA CCCAGGATG 

15 It'll / Un,bered 7 ' 29 ""..ponding to positions 35,618- 

35,596 in GENBANK Entry HUMADAG . Bases 1-6 corresponding 
to one of the four possible recognition sequences for the 
restriction enzyme BstYI, added to facilitate cloning 



EXAMPLE_VII 2uantif jL cation_of_Effect_of 

Chro 1 osomal_Deletions_of_Homolo 
Se£uences_Present in Host_Cell 
Orr-Weaver et_a L (Prcc, ^Tlcal Sci. USA Vol 
10:6354-6358, October 1981) showed th.TTpI^ld 
carrying the yeast LEU2 gene results in leu + transfer- 
»ants at a frequency of 1.4-1.7 per Mg of DNA when a 
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double-strand break was made in the pBR322 portion of the 
plasmid. This is 1/10 of the frequency at which leu + 
transformants arose when targeting was directed to the 
LEU2 gene by a double-strand break in LEU2 sequences 
(12-17 per pg DNA) . Similarly, when a HIS3 containing 
plasmid was cut within pBR322 sequences. his + trans- 
formants appeared at 1/60 of the rate observed when the 
same plasmid was cut within HIS3. In both cases, the 
non- targeted prototrophs were demonstrated to be the 
results of recombination between the plasmid and t he 
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chromosomal leu2" and his3* mutant genes. Thus, 
screening a library for one clone put of 50,000 by 
homologous recombination without deletion of the chr„ 
somal LEU2 gene would be expected to yield 5,000 leu + 
15 transformants which arise through homologous recombi- 
nation with the yeast genome when the targeting plasmid 
carries LEU 2 , even if a double- strand targeting break is 
made in another part of the plasmid. The results 
suggest, however, that deleting the chromosomal copies of 
LEU2 and HIS3 would eliminate virtually all of the non- 
targeted events . 

The advantage of chromosomal deletions from host 
cells for the purposes of the method was quantified as 
follows: A plasmid carrying the yeast ARG4 ("target") 
and URA3 ("marker") genes was transformed into a mixture 
of yeast cells after making a double-strand break at the 
unique Bell site in the ARG4 sequence. All of the cells 
xn the mixture had homology to URA3 , but only 1 in X 0 00 
or 1 in 10,000 had homology to ARG4 . This type of 
dilution experiment measures the relative frequencies of 
targeted and non-targeted events. For example, using 1 
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« •* DNA and . x to lfOO0 d 
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targeting plasmid and the chromosomal ura3 locus, then 
the non- targeted events resulting from homology at the 
ura3" locus are removed from the analysis and the ratio 
increases to 30,729 to 1. At this ratio, a sequence 

5 represented 3 times in 50,000 YACs would be correctly 
targeted 1.8 times for every one non- targeted event. 
This ratio would also result in the favorable ratio of 
one correct event for every 1.6 incorrect events when 
screening a library for a sequence present on only 1 in 

10 — 5 0tOOG-YACs- 

These results indicate that the selection of a 
targeted clone from a DNA YAC library is feasible and 
particularly efficient in host yeast cells that carry no 
homology with selectable markers present on targeting 

15 vectors. 
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CLAIMS 
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bacteria, and 3) t "lection i n 

— ble aarkers ^ Ho llfi; and 

same as the se library are not 

"cond li brary; leCtabl e markers in the 

introducing in t 0 ene 

««.«i. 8 DNA vector ^'^Z * 
*« «b. eukarvotic host cells "° B - r " ll ««*»« 
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*- ^e eukaryotic host cell! /" S ^ C ^ 
is homologous i„ Part / feting DNA 
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maintaining the product of step (b) under 



20 



25 



conditions appropriate for homologous recombi- 
nation to occur, whereby the eukaryotic host 
cells containing the target DNA fragment are 
stably transformed with the marker gene for 
selection in eukaryotic host cells as a result 
of homologous recombination between a target 
DNA fragment and the targeting DNA sequence and 
stably transformed eukaryotic host cells, with 
a selectable phenotype and containing the 
targe t - DNA— fragment" are produced; 

d) selecting stably transformed eukaryotic host 
cells by culturing the product of the previous 
step under conditions appropriate for selection 
of stably transformed eukaryotic host cells; 

e) digesting total DNA from stably transformed 
eukaryotic host cells with a restriction 
enzyme, thereby releasing from episomes an 
episome region which includes a target DNA 
fragment terminus, a marker gene for selection 
in bacteria; a marker gene for selection in 
eukaryotic host cells and sequences necessary 
for propagation in bacteria, thereby isolating 
an episome region; 

f) circularizing the episome region produced in 
the previous step, thereby producing a cir- 



cularized DNA molecule which is referred to as 
a subsequent targeting vector and comprises the 
episome region produced in step (e) , wherein 
the target DNA fragment derived from the 
episome region is referred to as subsequent 
targeting DNA; 
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g) selecting and amplifying the subsequent tar- 
geting vector in bacteria; 

h) introducing into the other DNA fragment library 
the subsequent targeting vector, thereby 
producing a mixed population of host eukaryotic 
cells ; 

i) maintaining the product of step (h) under 
conditions appropriate for homologous recombi- 
nation to occur, whereby eukaryotic host cells 

containin g_ th,e_t.a^g e.t_DN A-f-r-a-g-m e n t- a r «-wrtly 



j) 



25 

k) 
1) 



transformed with the marker gene for selection 
in eukaryotic host cells and the targeting DNA 
present in the subsequent targeting vector as a 
result of homologous recombination between a 
target DNA fragment and the targeting DNA 
present in the subsequent targeting vector, 
thereby producing stably transformed eukaryotic 
host cells with a selectable phenotype which 
contain the target DNA fragment that is homo- 
logous to the targeting DNA fragment present 
in the subsequent targeting vector; 
selecting stably transformed eukaryotic host 
cells by culturing the product of the previous 
step under conditions appropriate for selection 
of stably transformed eukaryotic host cells; 
repeating steps (e) through ( j ) as needed; 
constructing a physical map by ordering target 
DNA fragments derived from the episome region 
obtained in step (k) . 



1 

r 








i 


PCT/US91/08679 

WO 93/03183 








-92 - 






2. 


A method of Claim 1 wherein the targeting DNA 
molecule is a bacterial plasmid. 






3 . 


A method of Claim 2 wherein the bacterial plasmid 
has a double-strand break introduced within the 




5 


4. 


targeting DNA sequence. 

A method of Claim 1 wherein the target DNA fragment 
is cDNA. 






5. 


A method of Claim 2 wherein the episomes present in 
eukaryotic host cells are artificial chromosomes and 




10 


6. 


the artificial chromosomes additionally comprise all 
of the DNA sequences necessary for the artificial 
chromosome to participate in host cell replication 
and chromosome segregation. 

A method of Claim 3 wherein the episomes present in 


* 


15 




eukaryotic host cells are artificial chromosomes and 
the artificial chromosomes additionally comprise all 
of the DNA sequences necessary for the artificial 
chromosome to participate in host cell replication 
and chromosome segregation. 




20 


7. 


A method of Claim 1 wherein the target DNA fragment 
is selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 



sequences; mammalian genes; human genes; and plant 
genes . 



— — PCT/US91/08679 

WO 93/03183 



■93- 



8. A method of Claim 3 wherein the target DNA fragment 
is selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 
sequences; mammalian genes; human genes; and plant 

5 genes . 

9. A method of Claim 5 wherein the selectable marker 
gene is selected from the group consisting of genes 
which confer a selectable phenotype on eukaryotic 
host cells and the selectable phenotype is selected 

~i0 f rom the group c~o n~s~i~s~t~i~ng — c~£~; ant" i~b~i~e~tri - c resrst^ 

ance, nutrient prototrophy, tolerance to a metal 
ion, ability to progress through the cell cycle and 
expression of a cell surface marker. 

10. A method of Claim 6 wherein the selectable marker 
15 gene is selected from the group consisting of genes 

which confer a selectable phenotype on eukaryotic 
host cells and the selectable phenotype is selected 
from the group consisting of: antibiotic resist- 
ance, nutrient prototrophy, tolerance to a metal 
20 ion, ability to progress through the cell cycle and 

expression of a cell surface marker. 

11. A DNA sequence, isolated by the method of Claim 7, 
selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 

25 sequences; mammalian genes; human genes; and plant 

genes . 
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12. A DNA sequence, isolated by the method of Claim 8 f 
selected from the group consisting of mammalian DNA 
sequences; human DNA sequences; plant DNA sequences; 
mammalian genes; human genes; and plant genes. 

13. A method of producing a physical map of contiguous 
DNA sequences, comprising the steps of: 

a) providing a first DNA fragment library and a 

second DNA fragment library, wherein the first 
library and the second library are each in a 
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10 population of yeast cells in wh~i"ch genetic 

recombination between DNA introduced into the 
yeast host cells occurs by homologous recombi- 
nation; DNA fragments are present in the yeast 
cells in an episome which is replicatable in 
15 the yeast cells and additionally comprises: 1) 

sequences necessary for propagation in 
bacteria, 2) two different marker genes for 
selection in bacteria, and 3) two different 
marker genes for selection in the yeast cells; 
and the selectable markers in the first library 
are not the same as the selectable markers in 
the second library; 
b) introducing into one DNA fragment library a 

targeting DNA vector which is non- rep licating 
in the yeast cells, the targeting vector 
comprising a marker gene for selection in the 
yeast cells and targeting DNA which is homo- 
logous in part to a target DNA fragment derived 
from the episome region, thereby producing a 
20 mixed population of host cells; 
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maintaining the product of step (b) under 
conditions appropriate for homologous recombi- 
nation to occur, whereby yeast cells con- 
taining the target DNA fragment are stably 
transformed with the marker gene for selection 
in yeast cells and targeting DNA present in the 
targeting vector as a result of homologous 
recombination between a target DNA fragment and 
the targeting DNA sequence and stably trans- 
formed yeast cells, with a selectable phenotype 
and eoTrtra"i"ii"i"n"g the target DNA fragment are 
produced; 

selecting stably transformed yeast cells by 
culturing the product of the previous step 
under conditions appropriate for selection of 
stably transformed yeast cells; 
digesting total DNA from stably transformed 
yeast cells with a restriction enzyme, thereby 
releasing from episomes an episome region which 
includes a target DNA fragment terminus, a 
marker gene for selection in bacteria; a marker 
gene for selection in yeast cells and sequences 
necessary for propagation in bacteria, thereby 
isolating an episome region; 

circularizing the episome region produced in 
the previous step, thereby producing a 
circularized DNA molecule which is referred to 
as a subsequent targeting vector and comprises 
the episome region produced in step (e), 
wherein the target DNA fragment derived from 
the episome region is referred to as subsequent 
targeting DNA; 
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g) selecting and amplifying the subsequent tar- 
geting vector in bacteria; 

h) introducing into the other DNA fragment library 
the subsequent targeting vector, thereby 

5 producing a mixed population of yeast cells; 

i) maintaining the product of step (h) under 
conditions appropriate for homologous recombi- 
nation to occur, whereby yeast cells containing 
the target DNA fragment are stably transformed 

10 with the marker gene for selection in yeast 
cell s— and— the — targeting— DNA— present — i-n — the 



subsequent targeting vector as a result of 
homologous recombination between a target DNA 
fragment and the targeting DNA that is homo- 

i5 logous to the targeting DNA fragment present in 

the subsequent targeting vector, thereby 
producing stably transformed yeast cells with a 
selectable phentoype and which contain the 
target DNA fragment present in the subsequent 

20 targeting vector; 

j) selecting stably transformed yeast cells by 
culturing the product of the previous step 
under conditions appropriate for selection of 
stably transformed yeast cells; 

25 k) repeating steps (e) through ( j ) as needed to 

isolate target DNA fragments; and 
1) constructing a physical map by ordering target 
DNA fragments isolated in step (k) . 
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14. A method of Claim 13 wherein the targeting DNA 
vector is a bacterial plasmid. 

15. A method of Claim 14 wherein the bacterial plasmid 
has a double -strand break introduced within the 
targeting DNA sequence. 

16. A method of Claim 14 wherein the episomes present in 
eukaryotic host cells are artificial chromosomes and 
the artificial chromosomes additionally comprise all 
of tire DNA s~e~q u~e"n~c~e~s n ecessary for the artificial 
chromosome to participate in host cell replication 
and chromosome segregation. 



18 



17. A method of Claim 15 wherein the episomes present in 
eukaryotic host cells are artificial chromosomes and 
the artificial chromosomes additionally comprise all 
15 of the DNA sequences necessary for the artificial 

chromosome to participate in host cell replication 
and chromosome segregation. 



A method of Claim 13 wherein the target DNA fragment 
is c DNA . 



20 19. A method of Claim 13 wherein the target DNA fragment 
is selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 
sequences; mammalian genes; human genes; and plant 
genes . 



WO 93/03183 PCT/US91/08679 



-98« 



20. A method of Claim 15 wherein the target DNA fragment 
is selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 
sequences; mammalian genes; human genes; and plant 

5 genes . 

21. A method of Claim 16 wherein the selectable marker 
gene is selected from the group consisting of genes 
which confer a selectable phenotype on yeast cells 
and the selectable phenotype is selected from the 

— 10 group consisting of : aTrtib~i~o~tri~c resistance , 

nutrient prototrophy, tolerance to a* metal ion, 
ability to progress through the cell cycle and ex- 
pression of a cell surface marker. 

22. A method of Claim 17 wherein the selectable marker 
15 gene is selected from the group consisting of genes 

which confer a selectable phenotype on yeast cells 
and the selectable phenotype is selected from the 
group consisting of: antibiotic resistance, 
nutrient prototrophy, tolerance to a metal ion, 
20 ability to progress through the cell cycle and ex- 

pression of a cell surface marker. 

23. A DNA sequence, isolated by the method of Claim 19, 
selected from the group consisting of: mammalian 
DNA sequences; human DNA sequences; plant DNA 

25 sequences; mammalian genes; human genes; and plant 

genes . 
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24. 



A DNA sequence, isolated by the method of Claim 20 
selected from the group consisting of: mammalian ' 
DNA sequences; human DNA sequences; plant DNA 
sequences; mammalian genes; human genes; and plant 
genes . 
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26. 
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27. 



28 
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A method of Claim 13 wherein in step (b) the 
selectable marker gene for selection in yeast linked 
to targeting DNA is present in a replicating plasmid 
when introduced into the yeast cells and the 
-select^^^^,^^^ ^ target . ng flre 

subsequently released from the replicating pl asznid 
as a non-replicating molecule. 

A method of fragmenting human genomic DNA suitable 
for incorporation in a recombinant -DNA library which 
is to be used for mapping contiguous genomic DNA 
fragments, comprising digesting human genomic DNA 
with at least one restriction endonuclease selected 
from the group consisting of: A pal . Nsil and Seal 
thereby selecting against the occurrence of certain 
repetitive DNA sequences at the termini of the DNA 
fragments produced. 

Saccharomyces cerevisiae carrying a chromosomal 
deletion in four selectable marker genes. 

Saccharomyces cerevisiae of Claim 27 wherein the 
four selectable genes are ARG4 , TRP1, LEU2 and URA3 
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a) 
b) 
c) 
d) 



A YAC 



arm vector comprising: 
a yeast selectable marker gene; 
a bacterial origin of replication; 
a bacterial selectable marker gene; and 
a yeast telomere. 



31. 

10 
15 

32. 



The YAC arm vector of Claim 29 additionally com- 
prising a yeast replication origin, yeast centromere 
sequences or both. 



A YAC arm vector selected from the group consisting 
of: 

a) pTKENDA; 

b) pTKENDA2 ; 

c) pTKENDB ; 

d) pTKENDC; and 

e) pTKENDD . 

A yeast artificial chromosome comprising: 

a) two DNA sequences for replication in bacterial; 

b) two selectable marker genes for selection in 
bacteria ; 

c) two yeast telomere sequences; 

d) a yeast centromere; 

e) two selectable marker genes for selection in 
yeast; and 

f) one or more yeast origins or replication. 
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33. A DNA fragment library comprising a yeast strain 

carrying a chromosomal deletion of four selectable 
marker genes, the yeast strain having incorporated 
therein a pair of YAC arm vectors, the members of 
5 the pair referred to as a first YAC arm vector and a 

second YAC arm vector, each YAC arm vector com- 
prising: 

a) a yeast selectable marker gene which is one of 
the four selectable marker genes for which the 
10 yeast strain carries a chromosomal deletion; 

_b_) a_b_a_c_t_e_r_i.a.l o.r.i.g i.n_o.f repl-ica-tior.-; 

c) a bacterial selectable marker gene; and 

d) a yeast telomere, 

wherein a) one member of the pair of YAC arm vectors 
includes a DNA sequence which functions as a centro- 
mere in yeast; b) one or both of the YAC arm vectors 
includes an origin of replication functional in 
yeast; c) the first YAC arm vector and the second 
YAC arm vector are each ligated to a DNA fragment of 
non-yeast origin; and d) the first YAC arm vector 
and the second YAC arm vector each comprise a yeast 
selectable marker gene not present in the other 
member of the pair. 

The DNA fragment library of Claim 33 wherein the 
yeast strain carries a chromosomal deletion in the 
ARG4, TRP1, LEU2 and URA3 genes. 



20 
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25 



35. 



The DNA fragment library of Claim 34 wherein the DNA 
fragment of non-yeast origin is a mammalian DNA 
fragment or a plant DNA fragment. 
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36. The DNA fragment library of Claim 35 wherein the 
mammalian fragment is a human DNA fragment. 

37. A pair of DNA fragment libraries, comprising: 

a) a yeast strain carrying a chromosomal deletion 
5 of four selectable marker genes and having 

incorporated therein a pair of YAC arm vectors, 
the members of the pair referred to as a first 
YAC arm vector and a second YAC arm vector, 
each YAC arm vector comprising: 

1) a yeast selectable marker gene which is 
one of the four selectable marker genes 
for which the yeast strain carries a 
chromosomal deletion; 

2) a bacterial origin of replication; 

3) a bacterial selectable marker gene; and 

4) a yeast telomere, 
wherein a) one member of the pair of YAC arm vectors 
includes a DNA sequence which functions as a centro- 
mere in yeast; b) one or both of the YAC arm vectors 
includes an origin of replication functional in 
yeast; and c) the first YAC arm vector and the 
second YAC arm vector are each ligated to a DNA 
fragment of non-yeast origin; 

b) the yeast strain of a) having incorporated 

therein a second pair of YAC arm vectors, the 
members of the pair referred to as a third YAC 
arm vector and a fourth YAC arm vector, each 
arm vector comprising: 
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1) a yeast selectable marker gene which is 
one of the four selectable marker genes 
for which the yeast strain carries a 
chromosomal deletion ; 

2) a bacterial origin of replication; 

3) a bacterial selectable marker gene; and 

4) a yeast telomere, 
wherein a) one member of the second pair of YAC arm 
vectors includes a DNA sequence which functions as a 
centromere in yeast; b) one or both of the YAC arm 

vectors includes an ori g in of replication f_un.C-tior.a-l 

in yeast; and c) the third YAC arm vector and the 
fourth YAC arm vector are each ligated to a DNA 
fragment of non-yeast origin; and wherein the first, 
second, third and fourth YAC arm vectors each 
comprise a yeast selectable marker gene not present 
in the other YAC arm vectors. 

38. A DNA fragment library comprising a yeast strain 
carrying a chromosomal deletion of the ARG4 gene, 
the TRP1 gene, the LEU2 gene and the URA3 gene, the 
yeast strain having incorporated therein a pair of 
YAC arm vectors, each member of the pair ligated to 
a DNA fragment of non-yeast origin and the pair 
selected from the group consisting of: 
25 a) pTKENDA and pTKENDB; 

b) pTKENDA2 and pTKENDB; 

c) pTKENDA and pTKENDC ; 

d) pTKENDA2 and pTKENDC ; 

e) pTKENDC and pTKENDD; and 
30 f) pTKENDB and pTKENDD . 
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39. The DNA frag„ent library of Claim 38 wherein the DNA 
fragment of non-yeast origin is a mammalian DNA 
fragment or a plant DNA fragment. 

AO. The DNA fragment l ibrary of claim 39 wherein ^ ^ 

fragment of mammalian origin is a human DNA 
fragment. 
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Oligonucleotide 

12 GGTCTCTACA GGTTCTGACA TTATT 

13 CCGGCGTAGA GAATCCACAG GACGG 

14 CTCCTGATGA CGCATGGTTA CTC 

15 GGAAAGAAAT GCACAAGCTT TTGCC 

16 CCGATACCAG GACCTTGCCA TCC 
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